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ESTELLE R. JORGENSEN 
McGill University 


An Analysis of Selected Music Supervisor 
Tasks in Canada 


This article reports findings of a baseline questionnaire survey of music 
supervision in Canada with respect to music supervisor tasks. A profile of music 
supervisor tasks is described and comparisons by geographic region in Canada, 
music supervisor type, and jurisdiction size type are made. The implications for 
a theoretical model of music supervisor function are then discussed. 


An important aspect of research in school music administration is the development 
of a theoretical formulation which addresses fundamental questions with respect to the 
administrative role in school music and the relationship between school music 
administration and the instructional process. Research in school music administration 
has not yielded conclusive evidence upon which such theoretical formulation may be 


based. 


Several recently published studies of the music supervisor role may be noted. Ina 
time-use survey of music supervisors in California, McQuerrey (1972) found a 
divergence among music supervisor roles. Dawson (1972) presented largely 
inconclusive evidence respecting the essential role characteristics leading to successful 
music programs in California schools. Heller and Quatraro (1977), in investigating the 
relationship between the perceptions of role expectations and the perceived 
performance of the music coordinator for a sample of school districts in New York 
State, found a divergence among music staff, music coordinators, school board 
presidents and school board officers as to the perception of the role of music 
coordinator. They also found that in four areas, i.e., routine, staffing, curriculum, 
research and evaluation, the coordinators’ performance did not meet the level of 
expectation established. The order in which the supervisors came closest to fulfilling 
the level of expectation established was: (1) routine, (2) staffing, (3) curriculum, (4) 
research and evaluation. 

The objective of the study presently reported was an analysis of selected music 
supervisor tasks in Canada, in particular, comparisons by geographic region in Canada, 
music supervisor type, and jurisdiction size type. This was perceived as the first step in 
the process of generating a theoretical model of music supervisor function. 


Dr. Jorgensen is Associate Professor of Music and Chairman of the Department of School Music, McGill University, 
Montreal, Canada. Her published work focuses on aspects of school music administration and on theoretical issues in school 
music. She is a member of the executive committee of the Canadian Music Research Council and is currently involved in 
funded historical and theoretical projects in school music research. 


107 


E. R. Jorgensen 


Procedures 


This study is based on data from a questionnaire survey of music supervisors in 

Canada listed by Laughton (1974) and conducted in 1977 (Jorgensen, 1980). 

Twenty-nine music supervisor tasks were categorized as follows: 

1. Public Relations: Those tasks involving the relationship of the school music 

program with the community (both community involvement in the music 

program and community outreach by the school music program) as well as the 
relationship of the school music program to school board administrators and 
teachers. 

Administrative: Those organizational tasks imvolving centralization or 

decentralization of the school music operation, budgeting and financing of school 

music programs, among others. 

3. Consultative: Those tasks involving the conducting of workshops, the provision 
of teacher resources and inservice training, teacher counselling, and the 
advising of schools regarding facilities, scheduling problems, among others. 

4 Evaluative: Those tasks involving the selection of teachers for employment and 
for dismissal, the evaluation of school music programs, materials, equipment, and 
procedures, and the monitoring of student musical achievement, among others. 


Tasks were originally derived from Weyland’s list (Weyland, 1968) and subsequent 
additions and deletions were made following the writer’s pilot study of music 
supervision in the Berrien County, Michigan (Jorgensen, 1970). Subsequently, the 
instrument was refined following its presentation to a sample of selected music 
educators, educational administrators, sociologists, educational theorists, and two 
doctoral classes in educational administration at the University of Calgary. 

Respondents completed the checklist of 29 items with a “yes” or “no” as 
appropriate, indicating whether or not it was their function to undertake each itemized 
task. 


bo 


Questionnaire responses were categorized as follows: 
1. Musicsupervisor: The chief administrative officer responsible for school music in 
a given jurisdiction. 
Music supervisor - Type I: A music supervisor exercising function with respect to 
both the hiring and dismissal of music teachers. 
Music supervisor - Type II: A music supervisor exercising function with respect to 
either the hiring or dismissal of music teachers. 
4. Music supervisor - Type II: A music supervisor exercising no function with 

respect to either the hiring or dismissal of music teachers. 

5. Jurisdiction: The school board or province (state) over which the supervisor 
administers the school music program. 
Jurisdiction - Size Type I: A jurisdiction having a minimum of 29 music specialists 
(a minimum of 11 secondary school specialists), a minimum of 1,900 students in 
secondary school music programs, and music specialists at both the elementary 
and secondary levels. 
7. Jurisdiction - Size Type Il: A jurisdiction having 5 and less than 11 music 
specialists at the secondary level, and a minimum of 500 students in secondary 
school music programs. 
Jurisdiction - Size Type III: A jurisdiction having less than 5 secondary school 
music specialists, and less than 500 students in secondary school music programs. 


bo 
; 


Oo 
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9. Geographic Region: A tract of country with more-or-less defined boundaries or 
characteristics defined by politico-geographic criteria. Five regions in Canada 
were identified as follows: British Columbia, the Prairie Provinces, Ontario, 
Quebec, and the Maritime Provinces. All ten provinces were included in the 
study. 


Results 


A profile of music supervisor tasks categories is shown in Table 1, as are the mean 
number of Yes responses to the 29 Yes/No checklist questions of music supervisor tasks 
and the standard deviation for each task category. The highest meanof Yes responses 
was found in the public relations task category and the lowest mean of Yes responses 
and the highest standard deviation were found in the evaluative task category. While 
the means for public relations, administrative, and consultative task categories were 
grouped together, the mean for the evaluative task category was lower, resulting in a 
statistically significant difference in means (p<0.05). The variation of Yes responses 
within the consultative task category was the lowest of the four music supervisor task 
categories. 


TABLE 1 
PROFILE OF SELECTED MUSIC SUPERVISOR TASK CATEGORIES 


SSMS 


x 


Dad = Ss 
Music Supervisor Task Categories "Yeu™ Eespomses S.D. 
Public Relatious 66.33 11.68 
Administrative 59.17 10.87 
Consultative 65.42 $.37 
Evaluative 44,25 21.89 


Note. Number of respondents = 79 


The ranks assigned by the music supervisors to supervisor task categories in 
order of importance they attached to each are shown in Table 2. A Kendall 


Coefficient of Concordance was computed (W=0.03; p>0.05); no statistically 
significant level of concordance among supervisor rankings across supervisors was 
found. 


TABLE 2 


RANKS ASSIGNED TO TASK CATEGORIES BY MUSIC SUPERVISORS 


Music Supervisor Task Categories Assigned Rank 
Public Relations 3 
Administrative 2 
Consultative I 
Evaluative 4 


(Ww = 0.03; p > 0.05) 
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A comparison of selected music supervisor task categories by geographic region 
in Canada is shown in Table 3. No statistically significant difference by geographic 
region was found for any of the comparisons (p>0.05). 


TABLE 38 


COMPARISON OF SELECTED MUSIC SUPERVISOR TASK CATEGORIES 
BY GEOGRAPHIC REGION IN CANADA 


Music Supervisor Task Categories x2 df 
Public Relations 50 8 
Administrative 8.02 20 
Consultative SSS) 44 
Evaluative 15.66 28 


Note. All comparisons were not significant (p>0.05) 


A comparison of selected music supervisor task categories by music supervisor 
type is shown in Table 4. Three comparisons were not significant (p>0.05). A 


statistically significant difference by music supervisor type was found for evaluative 
tasks (p<0.05). 


TABLE 4 


COMPARISON OF SELECTED MUSIC SUPERVISOR TASK CATEGORIES 
BY SUPERVISOR TYPE 


Music Supervisor Task Categories x2 af Pp 
Public Relations SO 8 
Administrative 1.49 10 
Consultative L207 22, 
Evaluative 42.07 10 p <0.05 


Note. Three comparisons were not significant (p > 0.05) 


A comparison of selected music supervisor task categories by jurisdiction size 
type is shown in Table 5. No statistically significant difference by jurisdiction size 
type was found for any of the comparisons (p>0.05). 
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TABLE 5 


COMPARISON OF SELECTED MUSIC SUPERVISOR TASK CATEGORIES 
BY JURISDICTION SIZE TYPE 
eases es 


Music Supervisor Task Categories x2 af 
SSS SSS SSS a ee he gd ie 
Public Relations ONS 4 
Administrative 0.60 10 
Consultative 4.36 22 
Evaluative 36 14 


Note. All comparisons were not significant (p > 0.05) 


Discussion 


The findings imply a considerable degree of homogeneity in music supervisor 
tasks in Canada despite the wide variation in geography and in the size of music 
programs administered across the country. A similarity to findings of homogeneity 
in academic and professional preparation of supervisors in Canada is evident 
(Jorgensen, 1980). The reasons for this homogeneity warrant further investigation. 
The homogeneity of academic and professional preparation and supervisor task 
profiles supports the theoretical notion of an identifiable school music supervisor 
“type” (Zentner, 1979). 

Of particular interest in this study is the low ranking accorded evaluative tasks, 
both in terms of function (as indicated by Yes responses for each task) and 
perceived importance (as indicated by supervisor-assigned ranks). These results 
lend support to the conclusion that music supervisors in Canada are viewed by 
administrators and view themselves principally as “consultants.” Generally, they 
lack or fail to use evaluative powers which might enable them to accomplish their 
objectives more effectively. The evidence supports published findings in earlier 
studies in the United States (McQuerrey, 1972; Dawson, 1972; and Heller & 
Quatraro, 1977). 

The consistency of present findings for Canada with those in the U.S.A. further 
supports the theoretical notion of a Type C politico-organizational environment in 
which local governments have executive responsibility for music education, e.g., 
U.S.A. and Canada (Jorgensen, 1980), and where music supervisor tasks may be 
expected to be essentially similar. While individual tasks may vary in detail or may 
be emphasized differently from time to time or from place to place, there are certain 
fundamental similarities in supervisor function within a_ given _politico- 
organizational environmental type. Findings in the present study suggest that in 
Type C environments, evaluative functions do not play as important a role as other 
(e.g., consultative) functions. 

While it is interesting to examine the nature of school music supervisor tasks and 
to quantify them in various ways, other questions bearing on the relationship of 
music administration to the instructional process remain. For example, it has been 
hypothesized that: 

Music education when conducted in the context of an administrative system revealing a 
consistent policy over a wide social-actual space and through an extended period of time 
is more efficient than music education conducted in the absence of such a consistent 
policy over a wide social-actual space and through an extended period of time. 


(Jorgensen, 1977, p. 211). 111 
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This principle assumes: (a) a central link between a consistent policy over the long 
term and the quality of musical instruction; and (b) that some central agency or 
individual (viz., music supervisor) be empowered to implement such a consistent 
policy. It further implies an understanding of the political process of a given 
politico-organizational environment, for example, the identification of supervisor 
profiles (both preparation and function) which will maximize supervisor 
effectiveness in a given jurisdiction. These issues warrant further research. 


TABLE 6 


A SYSTEMATIC DIAGRAM OF SYSTEM PROCESSES 
AND THEIR RELATED NORM-SETS 


Processes Norm-sets 
Temporal-spatial coordination Time-space norms 


Goal seeking 
Information-exchange 
Recruitment 
Socialization 
Personification 
Evaluation 
Differentiation 
Segmentation 
Integration 
Responsibility-allocation 
Stratification 

Role coordination 
Decision-making 
Cooperation 
Competition 

Conflict 

Motive ascription 
Image-preservation 
Maturation 
Reality-maintenance 
Loyalty-maintenance 


Purpose norms 
Communication norms 
Admission norms 

Ideological norms 

Identity norms 
Reference-group selection norms 
Specialization norms 
Segmentation norms 
Jurisdictional norms 
Role-assignment norms 
Status-assignment norms 
Leadership norms 
Procedural norms 

Mutuality norms 

Resource allocation norms 
Accommodation norms 
Hypothecating norms 
Image-maintenance norms 
Maturity-maintenance norms 
Objectivity-maintenance norms 
Morale norms 


(Zentner, 1973, p. 128. Reprinted by permission) 


Part of the difficulty in examining the relationship of school music 
administration and instruction has resulted from the limitations of the research 
methodologies typically used. These have been largely based on structural 
functionalist models (Zentner, 1973). The characteristics of structural functionalist 
models include: (a) a closed system; (b) a system applicable only to a particular time 
and space and to a given integrative level of analysis, be it physical, psychological, 
institutional, societal or historical (Taylor, 1975); and (c) a static system, i.e., one 
which does not respond to changes through time and space. For example, specific 
supervisor tasks identified in the context of one jurisdiction are liable to change from 
time to time and may become obsolete or may be inappropriate to another 
jurisdiction or within another organizational structure. Further, difficulties of 
comparison arise from the use of.a variety of supervisor task taxonomies in research. 
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Alternatively, processual models, i.e., open systems applicable to all integrative 
levels of analysis or levels of causation and responsive to changes through time and 
space, may constitute a useful methodological basis for research. A variety of such 
models exploring the operation of processes in organizations are extant, including 
Etzioni (1964), Loomis and Loomis (1965), Bakke (1967), and Zentner (1973), among 
others. 


Of these approaches, Zentner’s model holds considerable promise for research in 
music administration. Professor Zentner has described 22 group processes with their 
associated “norm-sets” which are applicable to all social organizations (see Table 6). 

The analysis can be applied at a variety of integrative levels of analysis and 
especially at the institutional level. The processes need systematic and theoretical 
description and analysis within the context of school music administration and 
subsequent rigorous testing over the short as well as the long run. Zentner’s model 
enables an investigation of questions which are problematic when investigated via 
structural functionalist models. Further, it provides a useful framework for future 
research in school music administration, in particular, its relationship to the 
instructional process. 
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RONALD D. OWSTON 


University of New Brunswick 


Systematic Computational Errors and 
Achievement in Elementary Mathematics 


The relationship between systematic computational errors and achievement is 
examined using a sample of native Indian children. A significant relationship 
was found to exist between error type and achievement in multiplication, with 
high achievers tending to make nonsystematic errors and low achievers tending 
to make systematic errors. No significant relationships were found between 
achievement and error type for addition or subtraction. 


There is a general agreement that computational errors occur at all levels of 
achievement in elementary mathematics (Ashlock, 1976), yet no research appears to 
have been done which relates the kind of computational error pupils make to the level 
of achievement. In this study the relationship between systematic computational 
errors and achievement in addition, subtraction, and multiplication is investigated. 

A distinction must be made between errors due to incomplete knowledge of basic 
facts and errors due to faulty computational procedures. For example, an error such as: 
52 
aes 


76 


is an error in basic facts because the child, for a variety of possible reasons, failed to 
recall the correct sum of 2 and 3. On the other hand, an error such as: 
42 
x21 


82 


is acomputational error because the algorithm the child used to find the product was to 
multiply the one’s digits and the ten’s digits of the multiplicand and the multiplier. 
Clearly, the remedial procedures used for basic fact errors are different from those used 
for computational errors. Basic fact errors are readily identified and corrected, 
whereas Cox (1975b) found computational errors persistent and difficult to identify at 
times. 


Ronald D. Owston is Associate Professor in the Curriculum and Instruction Division of the Faculty of Education, University 
of New Brunswick and is coordinator of a B.Ed. program for native Indian students. He received his PhD in measurement and 
experimentation from the University of Ottawa. 


114 


Systematic Computational Errors 


Various methods of classifying computational errors have been proposed. 
Brueckner (1930) was one of the first to develop a classification scheme, although much 
of Brueckner’s work is of questionable value today (Ashlock, 1976). More recently, 
Roberts (1968) used four categories of errors or “failure categories”: wrong operation, 
obvious computational errors, defective algorithm, and random response. Only two 
categories, careless and conceptual, were used by West (1971). West’s classification 
system distinguishes between whether or not errors are consistently made. That is, 
careless errors are not consistent, but conceptual ones are. Once an error has been 
classified as consistent, the teacher must then establish if it is due to basic facts or 
inability to apply a correct algorithm. 


Cox (1975a), like West, emphasizes the consistency of errors in another 
classification system. A distinction is made between systematic errors, random errors, 
and careless errors. Systematic errors have a discernible pattern and must occur in at 
least three out of five problems testing a specific algorithm; random errors must occur 
as often but have no discernible pattern; and careless errors are those occurring fewer 
than three times. The rationale for requiring five problems to test skill possession was 
based on concern for reliable diagnoses of systematic errors. 


Method 


Subjects 

The subjects consisted of 60 grade 5 Micmac and Maliseet Indian children living on 
three reserves in New Brunswick. This group, except for several students who were 
absent during the testing, represented the grade 5 population on reserves who attended 
schools operated by the Federal Department of Indian Affairs. The reserves are in 
rural areas of the province, yet all are within 100 miles of a major urban center. Pupils 
in these schools follow essentially the same curriculum as their counterparts in 
provincial schools. The sample was considered particularly appropriate for the study, 
as teacher reports and standardized test results (e.g., Cooper, Elliott & Owston, 1977) 
have indicated that these children tend to score below provincial norms with the deficit 
increasing as they progress through school. 


Instrumentation 

Subjects were administered the Algorithm Assessment Test (AAT), a 200-item 
diagnostic test used by the New Brunswick Department of Education. The test 
measures three basic computational skill areas: addition, subtraction, and 
multiplication. Eleven different skill levels ranging in difficulty from finding the sum 
of a two-digit and a one-digit number without regrouping, to finding the sum of two 
three-digit numbers with multiple regrouping and zero involvement. There are 14 
subtraction skill levels ranging in difficulty from finding the difference between a 
two-digit and a one-digit number without regrouping, to finding the difference 
between a four-digit and a three-digit number with multiple regrouping. The 15 
multiplication skill levels range in difficulty from finding the product of a one-digit and 
a two-digit number to finding the product of a two-digit and a three-digit number with 
multiple regrouping. Each skill level is measured with five test items. Items are 
presented in the vertical form and subjects are directed to show all work required to 
calculate their answers. 

Because of the length of the test, it is divided into ten subtests, each of which 
contains at least five addition, five subtraction, and five multiplication items. The 
subtests were administered on consecutive days by the homeroom teachers in their 
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regular classrooms. There was no time limit, although 30 minutes proved sufficient in 
most cases. Also, since the emphasis of the test was on performing the algorithms 
successfully, subjects were told they could ask their teachers questions regarding the 
basic facts such as “how much is 2 x 2?”; however, questions about the algorithms such 
as “dolhave to borrow?” could not be asked. Despite these directions, teachers 
reported that no more questions were asked than was usual for other class tests. 


Procedure 


After the tests were administered, the answers were scored correct or incorrect and 
the total score for each of the three skill areas was found for each subject. Then with the 
help of several assistants, all incorrect answers for each skill level were examined 
together to discern an error pattern. If an error pattern was evident in at least three of 
the five answers, the error was classified as systematic, following the suggestion of Cox 
(1975a). The error was classified as nonsystematic if at least three of the five answers 
were incorrect but no pattern was found. Cases where a scorer was uncertain about the 
classification were referred to other members of the scoring team for discussion and 
eventual resolution. Errors which occurred fewer than three out of five times were 
categorized as “other” and were included in the study only to give an indication of the 
pervasiveness of systematic errors. 


When the error classifications were completed for addition, the papers were 
divided into two groups, high achievers and low achievers, according to the total 
addition score. High achievers were defined as those scoring above the median score, 
while low achievers were defined as those scoring at or below the median score. The 
number of systematic and nonsystematic error classifications made in each group was 
then tabulated. The same procedure was repeated for subtraction and multiplication. 


Results 


The numbers of errors made by all subjects for addition, subtraction, and 
multiplication are given in Table 1. From this table it can be seen that the greatest 
proportion of systematic errors occurred with subtraction (38.5% of the total), 
followed by multiplication (27.7 % ) and then by addition (12.4%). 


TABLE 1 


NUMBERS OF ERRORS MADE BY ALL SUBJECTS 
FOR ADDITION, SUBTRACTION, AND MULTIPLICATION 


Error Type Total 
Jperation —— 

Systematic Nonsystemati Other umoer 

; of Errors 

ogc a 32 (12.4) 11 ( 4.2 216 (83.4) 259 
suotraction 140 (38.5 20 565 204 (56.0) 364 
f Dp 3 92 r es = > = ~~ > 
ea hae cee te) a Se 6 15.6 221 (50./) 390 
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Presented in Tables 2, 3, and 4 are the numbers of systematic and nonsystematic 
errors made by high and low achievers for addition, subtraction, and multiplication 
respectively. In order to test the hypothesis of a relationship between error type and 
pecker a chi-squared test (with Yates correction factor) was applied to these 
tables. 


TABLE 2 


NUMBER OF SYSTEMATIC AND NONSYSTEMATIC ERRORS 
MADE BY HIGH AND LOW ACHIEVERS IN ADDITION 


— LL Eee 


Error Type _ Achievement _ Total Number 
Low High of Errors 
eS ee AES ee ee) dee Ve Ce 
Systematic 31 i 32 
NonsyStematic 10 1 itl 


ee ee eee 


TABLE 3 


NUMBER OF SYSTEMATIC AND NONSYSTEMATIC ERRORS 
MADE BY HIGH AND LOW ACHIEVERS IN SUBTRACTION 


: - Achievement Pot nine 
rror Type SSS £ 
ae High of Errors 
Systematic 25 15 140 
Nonsystematic 16 4 20 
TABLE 4 


NUMBER OF SYSTEMATIC AND NONSYSTEMATIC ERRORS 
MADE BY HIGH AND LOW ACHIEVERS IN MULTIPLICATION 


Achievement : Te Deer 
ae OB A; ot Exrnors 
Low High 
Systematic LOS 3 108 
Nonsystematic 52 9 61 
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A significant relationship was found between error type and achievement in 
multiplication (x?=5.25, p<.05). High achieving individuals tended to make more 
nonsystematic errors than systematic errors. On the other hand, more systematic 
errors than nonsystematic errors were made by low achievers. 

No significant relationships were found between addition achievement and error 
type (x?=.0003, p>.05) and subtraction achievement and error type (x’=.691, 
p>.05). Both high and low achievers tended to make a proportionally greater 
number of systematic errors than nonsystematic errors in addition and subtraction. 


TABLE 5 


EXAMPLES OF COMMON SYSTEMATIC ERRORS 
IN ADDITION 


Skill eae Description of Error 
of JEErOrE 
Adding a one-digit and 27 Units digits correctly added; 

a two-digit number; naw tens digit in upper addend added 
no regrouping required. 49 to units digit in lower addend. 
Adding a one-digit and M Units digits correctly added with 
a two-digit number; 43 correct regrouping; tens digit 
regrouping required. sure 8) in upper addend added to units 

142 digit in lower addend. 
Adding two three-digit al Units digits correctly added 
numbers; regrouping required 689 with correct regrouping; 
in units, tens, and hundreds +463 no regrouping performed in 
places. 10152 hundreds place. 


Due to the relatively few systematic errors made by high achievers for each of the 
three operations, meaningful comparisons could not be made between the kinds of 
systematic errors made by high and low achievers. Examples of the most common kind 
of errors made by the entire group are given in Tables5,6,and7 for addition, 
subtraction, and multiplication respectively. 


118 


Systematic Computational Errors 


TABLE 6 


EXAMPLES OF COMMON SYSTEMATIC ERRORS 
IN SUBTRACTION 


Skill Examples 


oh Reeone Description of Error 


Subtracting a one-, two-, or 31 526 634 The smallest number is sub- 
three-digit number from [uO 445.6 tracted from the largest 
another one-, two-, or BOD 2. number in each column. 


three-digit number; 
regrouping required. 


Subtracting a two- or three- 70 57 200 Whenever zero is in the same 
digit number from another =i On = 14 column as another number, zero 
two- or three-digit number 37 10 144 or the other number is given 
with a zero in minuend or as the difference. 
subtrahend; regrouping 
required. 

Subtracting a two-digit 89 The common multiplication 
number from another two- -47 algorithm is applied to a 
digit number; with or 2. subtraction problem. 
without regrouping. 45 

462 
TABLE 7 


EXAMPLES OF COMMON SYSTEMATIC ERRORS 
IN MULTIPLICATION 


Skill teats Description of Error 
OL ELror 
Multiplying a two-digit 35 Units digits multiplied without 
number by a one-digit number; x5 regrouping and tens digit of 
regrouping required. ak multiplicand placed in the 
325 hundreds position in the product. 
Multiplying a two-digit il Units digits multiplied 
number by a two-digit number; 35 with regrouping in tens 
regrouping required. x53, place of multiplicand; 
165 tens digits multiplied. 
Multiplying a three-digit DSS) Algorithm performed correctly 
number by a two-digit number. x45 except the lower partial 
1265 product is not shifted 
HOM? one place to the left. 
WTA 
Discussion 


Although no other studies have examined systematic errors and achievement, the 
study of Roberts (1968) provides a context in which the results of this study may be 
viewed. The findings of this study support his findings to the extent that errors 
involving incorrect application of an algorithm occur at all achievement levels, 
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including the fourth quartile. Roberts found that in the lowest quartile, random errors 
exceed algorithmic errors, however. This apparent contradiction might be explained 
by the fact that Roberts did not break down his analysis by operation, so possibly the 
division operation, which was not included in this study, biased the data. Another 
explanation could be that since Roberts did not use more than one test item to measure 
each skill, errors which appeared to be random would have been diagnosed as 
systematic with more adequate testing. If neither of these explanations prove correct, 
the contradictory findings could be explained by differences in which both groups of 
subjects were taught mathematics. 


The reason why achievement was found to be related to error type for 
multiplication yet not for addition and subtraction is less clear. The relatively small 
number of errors made by the high groups (2, 19, and 11 for addition, subtraction, and 
multiplication, respectively) may not be representative of the population and, hence, 
the conclusions should be regarded as tentative pending replication on a larger sample. 
Given this limitation, then, perhaps nonsystematic errors exceeded systematic errors 
for high achievers but not for low achievers because more sophisticated faulty 
algorithmic errors were made by high achievers. Some of these errors may have gone 
undetected by the scorers. If individual diagnosis had been possible whereby scorers 
could have asked questions regarding the subjects’ solutions, more nonsystematic 
errors might have been classified as systematic for high achievers. Whether or not this 
would have resulted in the number of systematic errors exceeding nonsystematic errors 
is open to speculation. 


Despite the tentative relationship established between multiplication 
achievement and error type, this study does suggest that systematic computational 
errors contribute substantially to poor achievement. Furthermore, the error 
patterns made by the present subjects do not appear to be different from the kinds 
described by Ashlock (1976) and others. Teachers must, therefore, make an effort 
to identify the kinds of systematic errors their pupils are making and prescribe 
appropriate remedial procedures, and to avoid simply labelling algorithmic errors as 
“stupid” or “careless” and telling their pupils to “try harder next time.” Beyond this, 
teachers should examine their teaching techniques with a view towards preventing 
systematic errors from occurring. 
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A Comparative Analysis of Vocabulary Load 
of Three Provincially Adopted 
Primary Arithmetic Series 


Results of a computer-assisted program to investigate the degree of variability, 
overlapping, and vocabulary load found between three current provincially 
adopted primary arithmetic series are presented. A Basic Comprehensive 
Primary Vocabulary List of 565 words was compiled, derived from vocabulary 
found to be common to two or more of the series studied. Findings indicate that: 
(a) the total number of running words differed considerably between series; (b) 
there is a marked increase in different words at both the second and third grade 
levels in each of the series; (c) the majority of different words in any one series 
were not found to be repeated more than one to nine times; and (d) not a large 
number of different words were found to be in common between series or grades 
in any given series. The results of these findings are further discussed in terms of 
their implications for primary teachers, textbook selection committees, and 
authors of arithmetic textbooks. 


Although a large number of new and revised editions of primary grade arithmetic 
textbooks have found their way into the classroom over the past few decades, the 
problem of reading difficulty of these texts continues to pose a serious concern for 
teachers in terms of providing for the needs of primary pupil reading ability (Dallman, 
Rouch, Char & DeBoer, 1978, pp. 323-324). 
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In recent years, primary arithmetic textbooks have undergone considerable change 
in content, methodology, and in the introduction of new and expanded vocabulary 
loads and concepts. These changes can perhaps be best exemplified by the introduction 
of “New Math” (Spache & Spache, 1977, p. 301) and the more recent adoption of 
metricated mathematics textbooks now widely used in Canadian schools. 


Since each school in the province of British Columbia selects one or more of the 
prescribed series, schools in a single district may well be using different series for the 
same grade levels. Thus it is reasonable to assume that a child transferring from one 
school to another within a district or between districts will encounter a different 
prescribed series. 


Inasmuch as vocabulary plays an important role in the reading ease or difficulty of 
most primary grade textbooks (e.g., basal readers) and because teachers rely mainly 
upon textbooks for instructional purposes, there appears to be a real need at this time 
to provide teachers with more current information on the degree of variability, 
overlapping, and vocabulary load found in most primary arithmetic textbooks. 


Background 


Earlier research studies including those of Repp (1960), Kerfoot (1961), Stauffer 
(1966), Smith and Heddens (1964), Reed (1965), Browning (1970), and Willmon (1971, 
1975) have previously explored vocabulary in primary arithmetic. However, the 
purpose of this study was to examine the vocabulary of more recent arithmetic series 
commonly used throughout British Columbia in order to develop an updated basic 
comprehensive primary vocabulary list (BCPVL) which would better reflect modern 
and metricated mathematics vocabulary found in current arithmetic textbooks. 


Problem 


The essential purpose of this study was to compare the vocabularies of three 
prescribed primary arithmetic series for grades 1, 2 and 3 in the province of British 
Columbia with respect to the following: 

1. What is the total number of running words for the combined texts in grades 1, 2 
and 3 for each of the three series? 

2. | Whatis the total number of running words in each text for grades 1, 2 and 3 in each 
of the three series? 

3. What is the total number of different words in each text for grades 1, 2 and 3 in 
each of the three series? 

4. What is the total number of different words for the combined texts in grades 1, 2 
and 3 for each of the three series? 

5. What is the frequency of repetition for each word contained in separate texts for 
grades 1, 2 and 3 in each of the three series? 

6. Whatis the total number of words in common by grade between series? 

7. What is the total number of words in common between any of the three series? 

In other words, in primary grades consistency of vocabulary across the grades and 
between series is of crucial importance for instruction. Repetition of words is also a 
significant variable to consider, especially at the primary level. 


Definition of Terms 
For purposes of this investigation, the following meanings were associated with 
these terms: 


Running Words. Every printed word appearing in: (a) any text for grades 1, 2 and 3, 
or (b) any one of the three series. 
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Different Words. Running words which are identified exactly once, regardless of 
frequency of occurrence in: (a) any text for grades 1, 2 and 3, or (b) any one of the three 
series. 

Words in Common. Words which appear at least once in: (a) any two textbooks of 
the same grade, irrespective of series, or (b) any two or all three of the series. 


Unique Words. Words which are not in common in: (a) any two textbooks of the 
same grade, irrespective of series, or (b) any two or all three of the series. 


Method 


The three arithmetic series selected for this study were those currently prescribed 
for British Columbia schools. They are widely used in many provinces of Canada and in 
some parts of the United States. 


The texts included in this investigation were limited to those of grades one through 

three, totaling nine books in all. Included were the following: 

1. Heath Elementary Mathematics Canadian Metric Edition. Books 1, 2, 3. 
Toronto: D.C. Heath Canada Ltd., 1975. 

2. Investigating School Mathematics. Books 1, 2, 3. Don Mills, Ontario: 
Addison-Wesley (Canada) Ltd., 1973. 

3. Project Mathematics SI Metric. Books 1, 2, 3. Toronto, Montreal: Holt, 
Rinehart and Winston of Canada Ltd., 1975. 


The total vocabulary of the three series was keypunched onto computer cards 
which were coded by grade and series. These cards were then entered and stored in 
computer files. A computer program was developed and applied to the running words 
to determine: (a) the total number of running words in each text and in each series, (b) 
the total number of different words in each text and in each series, and (c) the 
frequency of repetition of each different word in each text and in each series. A second 
computer program was then initiated to determine: (a) the number of words in common 
by grade between series, and (b) the number of words in common between any of the 
three series. 

Tabulations of data were based on the total number of running words which 
included: (a) proper nouns, (b) words containing inflectional endings, (c) 
abbreviations, and (d) variants, found in each of the three series. 

The criteria used to determine the Basic Comprehensive Primary Vocabulary List 
(BCPVL) were predicated upon the identification of words which were common to two 
or more of the series and not upon frequency of occurrence (Harris & Jacobson, 1972, p. 
34; 1973). However, in order to maintain consistency with most compilers of word lists, 
it was necessary to merge inflected word forms with root words (e.g., products, 
product); proper nouns, abbreviations and different root word endings were deleted 
(Harris & Jacobson, 1972, p. 27; 1973). Words were assigned to grade level according to 
their first appearance in a particular text, a procedure also used by Stauffer (1966, p. 
142). For example, if the word amount first appeared in a second grade text and later in 
a third grade text, it was identified as a second grade word. 
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Results 


There appears to be a considerable variation in the total number of running words 
between series. An examination of Table 1 shows that the distribution of these running 
words by series was: (a) Heath (DCH) 32 745; (b) Holt, Rinehart Winston (HRW) 
40 149, and (c) Addison-Wesley (AW) 45 080. Although few differences were found in 
vocabulary load between the first and second grade texts, there was a considerable 
increase in the number of running words between the second and third grade texts as 
well as the first and third grade texts. These differences are shown in Table 1, which 
reveals that the increase in the number of running words between the second and 
third grade texts for the DCH series and the HRW series is 2.6 and 2.7 times, 
respectively, while AW has a greater increase of running words between these same 
grades, 4.3 times. 


TABLE 1 


NUMBER OF RUNNING WORDS BY SERIES AND GRADE 


Grade DCH HRW AW 
il (By jfk: Sh ALY 3 803 
2 7, 856 9 724 th WBE 
3 AO) SUS) 26 708 Bis) 54s} 
4-2 3 Bye NS) 40 149 45 080 


Table 1 also illustrates an increase in the number of running words between first 
and third grade texts, in that DCH’s total number of running words is increased 4.4 
times, HRW’s increase is 7.2 times, and AW is increased 8.8 times. 

These results indicate that of the three series, a student using the DCH texts would 
have the least number of running words to contend with for grades one to three, 
whereas a student using the AW texts would be confronted with the greatest number of 
running words for these same grades. 

A further examination of data presented in Table 1 shows that the total number of 
running words in each of the Grade 1 texts, for each of the series, does not differ 
appreciably. However, at the grade 2 level, the HRW text has approximately 2 000 more 
running words than the DCH or AW texts. There appears to be very little difference in 
the number of running words in the grade 3 texts between DCH and HRW (6 393 
words) and between AW and HRW (6 835 words). However, the greatest difference in 
the number of running words is found between DCH and AW (13 228 words) — a 
sizeable difference. 
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TABLE 2 


NUMBER OF DIFFERENT WORDS BY SERIES AND GRADE 


Grade DCH HRW AW 
aa pn eee eee 
i! Bie 424 364 
2 762 763 917 
3 stent 1 540 2 075 
1 to 3 @ 1 595 1 785 Pe aN ey5) 


@ These totals exclude the number of different words overlapping between 
grades 1, 2, and 3 texts from each of the given series. 


An examination of Table 2 reveals the number of different words found in each text 
for grades 1, 2 and 3 from each of the three series. The total number of different words 
contained in the DCH series is 1 595; the HRW series, 1 785; and the AW series, 2 365. 
These data indicate that the AW series has an appreciably larger number of different 
words than the other two series, 33 percent more than DCH and 25 percent more 
different words than HRW. Further, the HRW series contains 11 percent more 
different words than the DCH series. 

The data also reveal that the increase in the number of different words between the 
consecutive graded texts is approximately double for both the DCH and HRW series. 
Moreover, the increase in the number of different words in the AW series is even 
greater (approximately 2.4 times). 

A further examination of Table 2 reveals some interesting results with respect to 
the number of different words contained in texts at the same grade level. For example, 
in comparing the grade 1 texts from each of the three series, HRW has 12 percent and 
14 percent more different words than DCH and AW, 52 and 60 words respectively. For 
the beginning reader this difference might well present a problem, depending upon the 
degree of complexity and familiarity of these words. 

A comparison of grade 2 texts shows that AW has 17 percent more different words 
than the DCH or HRW texts. Further, at the grade 3 level AW continues to show an 
increase in different words of 33 and 26 percent more than in the DCH and HRW texts 
respectively, a significant increase. 

In view of the findings from Table 2, one might conjecture that a student who begins 
with the DCH or HRW series and later transfers into the AW program at the second or 
third grade levels, would have to contend with a sharp increase in the number of 
different words to be learned. 

In addition to the variation in the number of different words between series and 
across grades, there is also a considerable number of new words within a given series. 
This is exemplified in the DCH series, wherein the total number of different words in 
common between grades one and two, and two and three, is 290 and 597 respectively. 
This means that 61.9 percent of all the second grade words, and similarly, 57.1 percent 
of all the third grade words were new words. These data suggest a possible problem in 
vocabulary usage in primary arithmetic series, in that a significant number of new 
words are introduced in each of the successive texts within a given series. 
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Series by 


TABLE 3 


FREQUENCY OF WORD REPETITION 
BY GRADE WITHIN A SERIES 


Total Number 


Word Repetition Intervals 


Grade Different Worgs yl 0iel soummm ls 100 26-50 10-25 1-9 
DCH-1 372 1.9% 3.8% 4.0% 14.8% 15.5% 
HRW-1 424 Hest 1.4% 9714 ag h2s 37 83.3% 
AW-1 364 0.5% 3.0% jan, ely 76.6% 
DCH-2 762 1.4% Lapa ASS 7. eens 80.2% 
HRW-2 763 ee 2.8% Seay men bect 76.7% 
AW-2 917 0.9% 2% 4 BEL LOWS 82.0% 
DCH-3 1 391 2257 3.6% 6.07 sips 74.8% 
HRW-3 1 540 2.9% Gy 5.8% 14.0% 73.6% 
AW-3 paddy 2.6% 3.2% ei wie bleed 76.7% 


The data contained in Table 3 reveal that a scant number of words were repeated. 
The frequency interval (1 to 9) shows that a high percentage of words are repeated 
fewer than ten times. For example, in Heath Book I (DCH-1) which contains a total of 
372 different words, 281 or 75.5 percent of these words are repeated fewer than ten 
times. This is an approximate ratio of 3:1 which means that for every four words a child 
encounters in this text, only one of four would be repeated ten or more times. Six of the 
nine textbooks have this approximate 3:1 ratio. The exceptions had even larger 
approximate ratios — HRW-1 and AW-2 had a ratio of 5:1, while HRW-2 had a ratio of 
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NUMBER OF COMMON AND UNIQUE WORDS 


TABLE 4 


BY GRADE BETWEEN SERIES 


Words Common to 
DCH, HRW, AW 
DCH, HRW 
DCH, AW 
HRW, AW 


Words Unique to 
DCH 


HRW 
AW 


Grade 1 


Number 


WZ 
183 
2 
166 


138 
196 
147 


Wc 
24. 
Zone 
21. 


18. 
25. 
19. 


Note. The percentages are based 
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N 


Grade 2 
umber i 
Behl Pats! 
412 270 
413 Died 
364 2359 
208 alisha s; 
258 16.9 
411 PTA) 


Grade 3 


Number 


695 
800 
941 
962 


345 
473 
867 


on totals derived from Table 2. 


BSS 
26. 
She 
Sy 26 


Fe aS 


TSS) 
15. 
18. 
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The data included in Table 4 show that the words found in common between any 
two series at the first grade level did not exceed 24 percent. Similarly, there was only a 
slight increase in the percentage of words in common at the second and third grade 
levels, namely 27 percent and 32 percent respectively. 


In viewing the combined series by grade, the percentage of overlap is even lower, 
15.9 percent at the first grade, 17.8 percent at the second grade and 23.2 percent at the 
third grade levels. These data appear to indicate that there is a minimal vocabulary 
overlap by grade between series. Further, words which were not found to be in common, 
that is, the remaining or unique words, constituted high percentages including: (a) 63.3 
percent of the first grade vocabulary of different words, (b) 57.5 percent of the second 
grade words, and (c) 56.2 percent of the third grade words. These findings are similar to 
those of Stauffer’s study of over a decade ago (Stauffer, 1966, p. 146). 


TABLE 5 


NUMBER OF COMMON AND UNIQUE WORDS BETWEEN SERIES 


Series 
Number 
2DCH + HRW + AW DCH + HRW DCH + AW HRW + AW DCH HRW AW 
Common Words 848 987 1 106 i L39 
Unique Words 350 507 968 
Percent Dee. 29.4 329 B3n0 10.4 sya! 28.8 


2 Total number of different words = 3 361 


Table 5 shows that there does not appear to be an appreciable difference in the 
number of words in common between any two series, or in the combined series. This is 
illustrated in the narrow percentage range of 29.4 to 33.9 for any two series, and 25.2 
percent for the combined series. 

There does, however, appear to be a considerable difference in the percentage of 
unique words found between series. The disparity between either the DCH or HRW 
series and the AW series is indeed great, 18.4 percent and 13.7 percent, respectively. 
Further, of the 3 361 different words of the combined three series, 1 536 or 45.7 
percent were words common to two series, while 1 825 or 54.3 percent were unique words. 

The above findings can be summarized as follows: 

1. The total number of running words differ considerably between series. The DCH 
series has the least number of running words, while the AW series has the greatest 
number. 

2. There was considerable increase in the number of running words found between 
the second and third grade texts, in each of the series. 

3. The greatest disparity in the number of running words between any of the grade 
texts was found between DCH and AW. 

4. The total number of running words does not appear to be a useful criterion in 
predicting the number of different words in these series. 
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5. The AW series contains a significantly larger number of different words than the 
other two series. 

6. The AW third grade text contains a significantly higher number of different 
words than the third grade texts of either DCH or HRW. 

7. Approximately 75 percent of the different words in any one series were not found 
to be repeated more than one to nine times. 

8. Notalarge number of different words were found to be common between series or 
grades in any specific series. 


9. A list of 565 words were found to be common to two or more of the series (see 
Appendix). 


Discussion and Implications 


The results of this study indicate that the total number of running words and the 
total number of different words are essentially independent. In light of this finding, it is 
suggested that teachers involved in selecting primary prescribed arithmetic series for 
classroom or district use exercise caution in making assumptions with regard to the 
nature of vocabulary based solely on the number of running words as a criterion. It 
might also be suggested that provincial textbook selection committees take into 
consideration criteria which yield information concerning the degree of variability, 
overlapping, and vocabulary load inherent in primary arithmetic series. This 
information might, in turn, be disseminated to teachers to assist them in their 
classroom or district selection and in their classroom use. 


Whereas in basal readers, vocabulary is generally carefully controlled and words are 
deliberately repeated many times, the data contained in this study revealed that at 
least 75 percent of the different words were repeated fewer than ten times in each of the 
texts. Teachers should therefore be alerted to the fact that this lack of repetition might 
present a potential problem for the unskilled and beginning reader. 


Finally, in view of these findings, it might be suggested that primary teachers, 
curriculum consultants, arithmetic textbook selection committees, and authors of 
arithmetic series give careful consideration to the basic vocabulary demands implicit in 
present day arithmetic series. 
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APPENDIX 


Basic Comprehensive Primary Vocabulary List 


The Basic Comprehensive Primary Vocabulary List (BCPVL) was compiled with the 
following purposes in mind: 

1. To reflect the vocabulary contained in current modern and metricated arithmetic series 
(197391975). 

2. To develop a comprehensive arithmetic word list representative of all the words found 
to be common in two or more given arithmetic series. 

The alphabetized BCPVL contained herein, consists of a total of 565 graded words 
including: (a) 164 first grade words, (b) 133 second grade words, and (c) 268 third grade 
words. 

The list is comprised of words that are common to two or more of all the series. The 
majority (89.7 percent) are common to all three series; only a small percentage (10.3 percent) 
are common to exactly two of the series. The latter word set are identified by a single asterisk. 
Grade levels are assigned according to the first appearance of a word in a particular textbook; 
thus, no words are repeated at successive grades in the final list. 
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Basic Comprehensive Primary Vocabulary List 


Note. Words identified by an asterisk were common to exactly two series, those 


without an asterisk were common to all three series. 


Grade One 
a about add addend addition 
all *an and are *arm 
as *ball bar *bead block 
blue *book *boy *calendar can 
*card cent centimetre circle *classroom 
clip color *compare count *crayon 
*desk dime *does *dog draw 
each eight eighteen *eighth *eighty 
eleven equal eraser *face fifteen 
*fifth fifty fill finish *first 
five *foot for forty four 
fourteen fourth from *girl has 
have *height hour how *hundred 
li in *introduction is large 
length less *like line *little 
long make many *mass match 
measure *minus missing more *my 
*need nine nineteen *ninety *ninth 
number o'clock of on one 
or order out paper paste 
pattern *pencil penny picture *plus 
*project put *recognize red *review 
ruler *school *second set *sequence 
seven seventeen *seventh *seventy shape 
*short six sixteen sixty *skip 
small *solid some *span square 
*stick *story *strip subtract table 
tell ten *tenth than that 
the then these third thirteen 
thirty this three time to 
twelve twenty two use way 
we what which *wide will 
with you your zero 
Grade Two 
am amount another answer any 
apple around at ate away 
back be big bird black 
body both bowl box brown 
*bubble build by candy car 
cat change children class coin 
complete container cookie correct cost 
could *cover —— cut day 
did difference different do dot 
eat enough far find fish 
fraction get go got grade 
graph *oreen group *gum had 
half he help here *high 
if 7 keep kilogram know 
left litre look made marble 
me metre milk minute money 
much multiplication eS “nickel no 
not now numeral off only 
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orange 
problem 
row 
show 
spent 
their 
together 
walk 
*word 


Grade Three 
above 
airplane 
animal 
bag 
begin 
birthday 
bus 
carefully 
chart 
code 
copy 
design 
dollar 
drive 
empty 
example 
father 
fold 
game 
gram 
head 
hold 

it 
kilometre 
least 
loop 
magic 
might 
multiple 
next 
once 
package 
part 
pick 
plane 
price 
reach 
roll 

say 
several 
side 
solve 
stamp 
string 
symbol 
thick 
through 
trip 
under 


other 
read 
same 
small 
start 
them 
took 
want 
write 


across 
along 
apart 
baseball 
below 
bottle 
but 
carton 
check 
collect 
corner 
distance 
done 
during 
end 
exercise 
favorite 
follow 
gave 
great 
her 
home 
jar 
knife 
leg 

lost 
mark 
model 
must 
notebook 
open 
page 
path 
pie 
play 
product 
really 
round 
scale 
shade 
sigh 
special 
stand 
study 
tape 
thin 
tile 
true 
unit 


over 
rectangle 
see 

sO 
symmetry 
there 
trace 
water 

ZOO 


after 
also 
area 
basic 
best 
bought 
buy 
centre 
child 
column 
cross 
divide 
door 
easy 
estimate 
fact 
few 
found 
geometry 
guess 
hill 
information 
join 
last 
letter 
lot 
math 
month 
near 
object 
outside 
paid 
pay 
piece 
pop 
puzzle 
record 
rule 
score 
sheet 
sister 
speed 
star 
sum 
teacher 
thing 
top 

try 
until 


Vocabulary Load of Three Arithmetic Series 


people 
*remember 
she 
*sort 
take 
they 
triangle 
were 


again 
always 
arithmetic 
been 
between 
brother 
call 
chalkboard 
chocolate 
come 
cube 
division 
down 
edge 
even 
factor 
figure 
fruit 
give 
hand 
him 
inside 
jump 
late 
library 
lunch 
mean 
most 
new 

odd 

own 
paint 
per 

pile 
possible 
quarter 
remainder 
run 

sell 

shirt 

size 
spend 
still 

sure 
telephone 
those 
total 

turn 

us 


point 
room 
shoe 
space 
tall 
think 
up 
when 


ago 
among 
arrow 
before 
bicycle 
bundle 
came 
chapter 
close 
continue 
cup 

doll 
dozen 
egg 
exact 
family 
fit 

gain 
glass 
happy 
his 

into 
just 
learn 
list 
machine 
member 
mother 
newspaper 
old 
pack 
pair 
person 
place 
practice 
question 
right 
save 
sentence 
should 
sold 
sphere 
store 
swim 
temperature 
thousand 
travel 
twice 
value 
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very volume was watch week 
weigh well went wheel where 
whether white who why width 
window wish without work would 
wrong year yellow 
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Judging Global Readability 


Estimating readability, the difficulty of reading materials, is an important 
teacher function. This study addressed the problem of judging global readability 
through the matching of passages with unknown difficulty to criterion sets of 
passages with known readability. Forty-two teachers with 2-20 years of 
experience rated a set of passages by two global readability techniques—the 
SEER and the Rauding procedures. 

The findings indicated that the two procedures resulted in significantly 
different ratings, that the amount of teaching experience did not affect the 
ratings, and that the SEER ratings agreed substantially with readability formula 
estimates of the same passages. 


Estimating the difficulty of reading materials is an important teacher function, 
since it is the primary method used for matching readers and materials. Readability 
procedures have varied from detailed (Dale-Chall, 1948; Spache, 1953; Bormuth, 
1966) to simple (Fry, 1968; McLaughlin, 1969; Raygor, 1977; Baldwin & Kaufman, 
1979) to linguistic (Granowsky & Botel, 1974) to global (Carver, 1975-76; Singer, 
1975), and each procedure has claimed either reliability, validity, or calculation ease 
as an advantage. Depending on the purpose of the readability rating and the number 
of raters available, the global method may be a viable one. 


Global readability estimates involve the matching of materials with unknown 
readability to a criterion set of passages with known readability (but which typically 
have been calculated by one or more formulas). After selecting the closest passage on 
the criterion set, the same readability figure is assigned to the passage with the 
unknown difficulty. This matching technique has been used in the rating of 
handwriting samples for many decades. Global ratings also have the possibility of 
taking into account ideational difficulty which the formulas cannot. However, global 
ratings do depend heavily on the reliability of the rater. 


Dr. Froese teaches reading and language arts courses at the University of Manitoba. His research interests are in the 
area of basic and applied language processes. He has published widely in journals and has co-authored two university 


level textbooks. 
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Because of the validity and the calculation ease reported by two global 
readability procedures, the Rauding Scale (Carver, 1976) and the SEER technique 
(Singer, 1975), this study sought to examine the utility of these two procedures for 
the classroom teacher. More specifically, the questions asked by this study were: 

1. Are the mean ratings using the Rauding and the SEER technique practically 
and statistically in reasonable agreement? 

2. Do teachers with varying amounts of teaching experience rate passages 
differently? 

3. How do Rauding and SEER ratings compare with ratings based on the Spache 
or Dale-Chall formula? 


Sample 

The 42 teachers enrolled in a fifth-year level university reading course who had 
completed both sets of ratings were included in the study. Their teaching experience 
ranged from two to twenty years and the following frequency distribution was used 
in the analysis: 


Experience in Years Frequency 

2 5 

3 7 

4 7 

5 4 

6 - 10 8 
11-15 7 

16 - 20 4 


A 2 1/2 - hour lecture on the topic of readability had preceded the rating 
sessions. The lecture included information about the development of readability 
formulas, factors of readability, accuracy of the formulas, sources of readability 
information and special areas of concern (i.e. content areas, examinations). In 
addition each teacher was required to calculate the readability of a textbook and a 
tradebook using the Spache formula for primary level materials and the Dale-Chall 
formula for the intermediate levels and above. Information about the Rauding and 
the SEER techniques was presented as part of the above mentioned lecture. 


Methodology 


The eight passages were taken from Singer’s (1975) “unknown paragraphs” but 
rearranged randomly in two ways (6, 8, 2, 5, 7, 3, 1, 4, and 7, 4, 6, 5, 1, 8, 3, 2) so as to 
make two forms. An answer sheet with blanks numbered one to eight was provided 
along with the passages and the criterion passages. 

Directions to teachers followed the instructions given by Singer and by Carver in 
their respective journal articles except that only one rating was possible for each 
passage rather than the three suggested by Carver. (The reader could take the 
means in Table 1 as better predictors if desired.) In short, teachers read the sample 
passages and compared them to the criterion passages as provided by Singer and 
Carver. 
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One week the teachers were asked to rate the passages by the SEER technique, 
the next week by the Rauding index. It was recognized that order effect could not be 
ruled out by this procedure; however, it was not considered to be an important 
factor. The readability ratings for the eight passages using the Spache or Dale-Chall 
formula were reported to be: 1, 4.8; 2, 6.4; 3, 2.1; 4, 4.5; 5, 5.9; 6, 3.0; 7, 1.8; 8, 3.8. An 
SOL factorial analysis of variance program which could handle unequal replications 


per cell was used to analyze the data. 


Findings 


Table 1 presents the mean rating for the SEER and the Rauding techniques. The 
ANOVA results in Table 2 indicate that except for passage #4 (Original paragraph 
beginning: Several times. . .) all the remaining passages had statistically significant 


different ratings by the two techniques. 


TABLE 1 


MEAN PASSAGE READABILITY RATINGS BY TEACHERS 


WITH VARYING AMOUNTS OF EXPERIENCE 


Experience in Years 


Passage Formula 
3 4 
1 S 4.44 4.00 everless 
R 40 Mou Dy 
2 S 34 SAY) D4 
R 1.60 1.86 Pa sl 
3 S P90 2 oA Sy 
R 4.75 4.57 Ze) 
4 5 4.08 55 oe 4, 19 
R 4.68 4.71 Dias 
5 S 4.30 4.43 5.41 
R 1380 2.00 5 UP 
6 S 2.06 2.4/ 72, JO) 
R 3.00 4.14 A. 
: S 2.64 Mies (EG Vary 
R 5.40 ey all 7.14 
S 4.48 SGP) 5.40 
R 4.00 a 29 4.86 


5 


+33 


a ts: 


-10 
-50 


68 
00 


-43 
Bea) 


-78 
aS) 


-48 
BEL) 


EN) 
-00 


Fo)s) 
BOES: 


Iu; 


Weighted 
Means 


4.20 
1.6 


on 
2.02 


ele 
4.80 


te 
4.77 


A 7D 
2.41 


The answer to the first question is therefore: No, the Rauding and the SEER 
technique do not result in similar readability ratings, neither in a statistical nor in a 


practical sense. 
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A rank order correlation of -0.548 was found between the SEER and Rauding 
ratings; 0.881 between the SEER and the Spache/Dale-Chall ratings; and -0.762 
between the Rauding and the Spache/Dale-Chall ratings when mean ratings for all 
eight passages were considered. This comparison is perhaps the fairer one to the 
Rauding procedure since it suggests using the average of three ratings as a means of 
establishing grade difficulty. 

The second question dealt with differences in ratings by teachers with varying 
amount of experience. Table 2 indicates that ANOVA results are not significant for 
any of the eight passages rated. The amount of teaching experience does not appear 
to affect the accuracy of the ratings in a statistically significant fashion. This 
conclusion is further supported by the fact that no significant interaction effects 
were found between the amount of teaching experience and the formula used. 


TABLE 2 


ANALYSIS OF VARIANCE RESULTS BY PASSAGE 


Passage 
| Z 3 4 5 6 i 8 
Formula iAP oss*) I T0n47**.  Ii2noo%*) 0.027 98.i50%) 939). 00st 24 58% 7 2 oo 
(df=1,70) 
Experience 0.98 ORS Shig ML Oot oS 1535.6 Ls 57 0.63 
(d£=6, 70) 
Interaction 0.84 bo Se 6.04 0.68 Do DY 0.24 balOS 0.39 
(df£=6, 70) 


* significant at .05 level 
* * significant at .01 level 


Do the global readability ratings compare favourably with the readability 
predictions from the Spache/Dale-Chall formulas? Table 1 lists means (the mean 
rating of all 42 subjects for each passage) and the Spache or Dale-Chall ratings were 
given earlier in the text. Even a visual inspection reveals a reasonably close 
relationship between the SEER and the Spache/Dale-Chall ratings (rho = 0.881). 
The Rauding ratings appear to result in a reciprocal ordering (rho = -0.762). 

The answer to the last question depends somewhat on which criterion variable 
one is willing to accept. If the Spache/Dale-Chall ratings are chosen, the SEER 
technique is in most agreement. However, the dangers in accepting Spache/ 
Dale-Chall ratings have been discussed elsewhere (Froese, 1975). 


Discussion and Conclusions 


Since this study examined only the ratings of eight passages it cannot be 
considered representative of rating all types of materials. However it may be said 
that the SEER and the Rauding technique do not produce similar ratings on these 
passages. It should be noted, however, that the range of passages is small (1.8 to 6.4) 
and hence the ability of either technique to rank a wide range of materials is not 
known. 
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There appears to be little difference in the ratings of beginning or experienced 
teachers, and neither technique seems to favour a particular experience level group 
in a statistically significant manner. However, more guided practice in making 
readability estimates might well result in better judgements. 


If the Spache and Dale-Chall formulas are accepted as the criterion variable, 
then the SEER ratings are highly correlated (rho = 0.881), and appear to be a 
reasonable alternative if several raters are employed. The ratings of individual 
passages vary considerably (a visual inspection indicates that a standard deviation 
of approximately 1.5 is rather typical) and hence it remains to be shown that 
reasonable consistency may be maintained for a single rating of a particular passage. 
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Word Associated Arousal as an Encoding 
Dimension in Short Term Memory 


This study employed the Wickens, Born and Allen (1963) release from proactive 
interference experimental paradigm. A significant amount of release from 
proactive interference was obtained by shifting between differentially arousing 
categories of words. This finding suggested that rated word arousal is an 
encoding dimension in short term memory. The stimulus words were then rated 
by a semantic differential type procedure of the Osgood, Suci, and Tannenbaum 
(1957) variety in order to locate the word arousal dimension within semantic 
space as it is defined by the Activity, Potency, and Evaluation factors. 


The notion that motivation can affect retention has been extensively studied 
(Berlyne, 1967; Rapaport, 1942 ; Weiner, 1966). Unfortunately, the multiplicity of 
motivational theories (cf. Berlyne, 1967; Cofer & Appley, 1964), the fragmented nature 
of much of the research, and the equivocal nature of many of the studies (cf. Berlyne, 
1967; Weiner, 1966) appears to preclude a clear statement of the specific relationship 
between motivation and retention. In a comprehensive review, Weiner (1966) 
concluded that in spite of these shortcomings and other more specific methodological 
weaknesses, there exists substantial evidence that memory is in fact influenced by 
motivation. 


Arousal, traditionally regarded as a motivational variable, has been found to 
influence memory, although there is not general agreement on how it does so (Berlyne, 
1967; Weiner, 1966). The preponderance of the evidence suggests that higher levels of 


Allan Hayduk is Acting Director of Course Development at Athabasca University. His current interests include learning, adult 
education and distance education. 
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138 


Word Associated Arousal 


arousal facilitate long term retention (Berlyne, 1967; Osborne, 1971). That arousal 
affects short term recall has been well established, but the directionality of the effect 
remains unclear. Osborne (1971), in a review of this question, concluded that the weight 
of the evidence indicates that arousal facilitates short term retention but that 
considerable evidence to the contrary also exists. In addition, some studies have shown 
arousal to have no effect upon short term retention (Berlyne, Borsa, Craw, Gelman & 
Mandell, 1965). The unclear nature of the relationship between arousal and memory 
appears partly to reflect the confused state of the relationship between motivation and 
memory in general. 


Further, there appears to be uncertainty as to whether the effects of arousal and 
other motivational factors reflect the degree of original learning, the degree of 
retention, or the degree of subsequent recall (Weiner, 1966; Underwood, 1964). Indeed, 
many studies have failed to make these distinctions, in that motivational components 
were manipulated at the time of the stimulus presentation followed by measurement of 
the degree of stimulus recall (Weiner, 1966). Other studies have manipulated the 
arousal variable only during retrieval (Weiner, 1966). Without the experimental 
separation of learning and retention, the two variables become confounded. Asa result, 
no definitive conclusions can be drawn about the relationships between arousal and 
learning and between arousal and recall (Weiner, 1966). It should be noted that some 
researchers, such as Underwood (1964), have maintained that it is impossible to 
determine whether motivation affects retention directly or indirectly through its 
influence on learning. 

The identification of a specific relationship between arousal and learning, or 
between arousal and recall, wouid help to clarify a number of the uncertainties 
discussed above. For example, if arousal were found to function strongly as a learning 
or encoding variable, Berlyne’s (1967) suggestion that arousal can function by 
triggering a specific class of learned or innate behaviour would be supported. Such a 
finding would neither counter nor weaken the concept of arousal as a nonspecific and 
indiscriminate motivational variable, in that if arousal is a bonafide psychophysical 
variable, then high arousal words could have a nonspecific, activity increasing, 
performance strengthening function as well as a specific learning or encoding function. 

The identification of arousal as a learning variable would suggest a re-evaluation of 
many of the studies which have failed to make the distinction between learning and 
recall components. In many of these cases more post hoc emphasis could be placed 
upon learning and less upon recall mechanisms. Such a re-evaluation should help to 
clarify further the relationship between arousal and memory. 

The primary purpose of this study was to investigate word arousal as an encoding 
dimension in STM by using a release from Proactive Interference (PI) experimental 
design of the Wickens, Born and Allen (1963) variety. 


Experiment One 


The release from proactive interference (PI) experimental paradigm (Wickens, 
1970, 1972) has been used widely as an empirical means of identifying the dimensions 
along which items are encoded in short term memory (STM). Experimentally, a 
progressive decrement in recall of items in sequentially presented word triads drawn 
from the same semantic category (PI) is followed by an improvement in performance 
upon the presentation of a word triad drawn from a different semantic category 
(release from PI). This result suggests that the two semantic categories are encoded 
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separately in STM. Because this experimental paradigm is widely regarded as an input 
phenomenon (cf. Wickens, 1970), it was used in this study to identify arousal as an 
input and, hence, learning variable. 


Appropriate to the release from PI experimental design, categories of differentially 
arousing words were used as arousal determinants. Although words that have been 
rated on an a priori basis as differentially arousing have been used successfully as 
arousal determinants in a number of other arousal and memory studies (Osborne, 1972, 
1973; Maltzman, Kantor & Langdon, 1966; Kleinsmith & Kaplan, 1963, 1964; Walker 
& Tarte, 1963), it should be noted that the use of rated words as arousal determinants 
essentially constitutes an operational definition of arousal and represents a departure 
from the physiological perspective. 


Method 


Subjects. The subjects were 96 male and female University of Alberta 
undergraduate students from Educational Psychology courses. All students who 
participated received course credit as well as payment. Two subjects were replaced, one 
for failure to follow instructions, and one for previously undisclosed memory 
improvement training. None of the subjects had previously taken part in any 
psychological experiments. 


Materials. Osborne (1972) chose 40 nouns from the 925 nouns listed by Pavio, 
Yuille, and Madigan (1968) on the basis that they be equated for mean imagery and 
frequency, but differ maximally on arousal. These were subsequently rated for arousal 
on a seven-point scale, using a procedure similar to that of Pavio et al. (1968), by 218 
volunteers from an Educational Psychology course at the University of Alberta. From 
these 40 nouns, the experimenter chose 12 low arousal (harp, cigar, ankle, hurdle, fiord, 
banner, caravan, appliance, leaflet, hamlet, toast, juggler) and 12 high arousal 
(policemen, volcano, agony, alcohol, brute, skull, avalanche, morgue, ghost, disaster, 
snake, jail) nouns for the present study. The choices were made on the basis of maximal 
discrimination of low and high rated arousal levels and minimum potential for 
mnemonic association. They were then grouped into triads which the experimenter 
again chose so as to minimize mnemonic association (e.g., not policemen, handcuffs , 


jail but policeman, volcano, ghost). Because similarity of word sounds may affect 
interference (Conrad, 1964; Wickelgren, 1965), no similar sounding words were 
included in any one triad. 


Procedure. All slide sequences were projected with a carousel AV 900 projector, 
which was controlled by a Sony 630 reel-to-reel tape deck timer. All instructions were 
videotaped and presented to the subjects on a remote controlled videotape monitor. 


Each subject entered the experimental room and was seated. Personal data were 
obtained, and then the subject viewed a standard set of videotaped instructions. The 
mechanics of the experimental situation, as well as the performance required of the 
subject, were explained and the videotape was stopped. If the subject had any 
questions, they were answered and the experiment began. 


In general, the format of the procedure was as similar as possible to the Wickens 
(1972) procedure. That is, all materials were projected onascreen about 1.5m in front 
of the subject. An asterisk attention signal appeared for 2 seconds. It was replaced by 
the first word triad which appeared for about 1.5 seconds. The subject repeated the 
words once in order to expose perceptual or processing errors. A three-digit number 
slide appeared, and the subject began immediately to count backwards, by threes, from 
the displayed number as rapidly as possible for 15 seconds. The number was replaced 
by a question mark and the subject had 10 seconds to recall as many of the three words 
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as he could remember before the question mark disappeared. After a rest interval of 6 
seconds, an asterisk appeared for 2 seconds as a ready signal for the second triad. The 
experiment proceeded thus for three triads during which the subjects saw words from 
only one arousal category (high or low). 


On the fourth trial, the experimental group was presented with words from the 
other category (i.e., the previously low arousal group now saw a high arousal triad and 
vice versa). An equal number of subjects changed from low arousal words to high 
arousal words as changed from high arousal words to low arousal words. The control 
groups saw all four word triads only from within the same category (i.e. , either all low 
arousal words or all high arousal words). In order to identify subjects who attempted to 
facilitate word rehearsal by lessening their concentration on the distractor task, both 
the rate of counting and the number of errors in counting were recorded during each 
distractor task. 


Design. The stimulus words were distributed in all groups by using a 
counterbalancing rotation system so that each word occurred in each position of each 
triad an equal number of times throughout the experiment. Each triad occurred in each 
possible trial position an equal number of times. Because of this rotation system, each 
word occurred in each possible trial position two times during the experiment. In no 
case for the second presentation of any given word was the word above or below it in the 
same position as in the first presentation. Exactly the same procedure was adopted 
with the high arousal words as with the low arousal words. The low to high 
experimental (shift) group was made by removing the fourth trial row from the low 
control group and replacing it with the fourth trial row from the high control group. 
The high to low experimental (shift) group was made up in a similar fashion. 

There were 48 subjects in each experimental and control group. That is, the overall 
“experimental” group was composed of 24 people who viewed a high to low arousal shift 
and 24 people who viewed a low to high arousal shift. Similarly the overall control group 
was composed of 24 people who viewed high arousal words only and 24 people who 
viewed low arousal words only. All subjects were randomly assigned to experimental 
positions. The distractor task subtraction numbers were counterbalanced so that the 
same numbers did not appear in the same ordered position throughout the experiment. 


Results and Discussion 

Responses were scored as correct when they were given within the recall interval of 
the trial on which they occurred and without regard to the original order of the words. 
In the manner suggested by Wickens (1970), both the non-shift groups were combined 
to form a single control group and both the shift groups were combined to form a single 
experimental group. This combination of groups was possible because the shift from 
high arousal to low arousal produced the same amount of release as the shift from low 
arousal to high. Similarly, no difference in performance was found between the two 
control groups. 

Figure 1 presents the percentage of correct recall at each trial for these combined 
experimental and control groups. 

A t test for independent groups with equal numbers indicated a significant 
difference, t (87) = 6.62, p<.001, in performance on Trial 4 between the experimental 
and the control groups. A t test for dependent groups indicated a significant difference, 
t (47)=6.08, p<.001, between performance on Trial 3 and on Trial 4 for the 
experimental group. 

These results may also be reported by using the standard Wickens’ (1970) method 
of quantifying release from PI, in which case the release from PI obtained in this study 
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Figure 1. Percent recall from short term memory by the combined control group and shift 
group. 


The results show the usual PI effects found on similar positive experiments (Loess, 
1967;Wickens, 1970; Wickens, Born & Allen, 1963). A progressive decrement in 
performance occurred over the first three trials for both groups and was followed, on 
the fourth trial, by a marked increase in performance by the experimental group and a 
further decrement in performance by the control group. The rapidity and the amount 
of the preshift PI buildup, together with the shift contingent change in performance, 
indicate that the high and low arousal categories represent specific homogeneous 
classes of material with respect to at least some portion of their underlying meaning or 
the psychological processes representing that portion. The fact, as well as the 
magnitude of the effect, suggest that word associated arousal, an operationally 
derived intervening variable, functions as a semantic encoding dimension in STM and 
hence as a learning variable in studies of the relationship between word determined 
arousal and memory. 
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TABLE 1 


MEANS AND DIFFERENCES OF THE EVALUATION, ACTIVITY 
AND POTENCY COMPONENTS OF THE AROUSAL CATEGORIES 


Component High Arousal Low Arousal Difference 
Evaluation Says} 4.74 Sel 
Activity Say? 4.62 he. 
Potency Synily 4.08 1.09 


Experiment Two 


It may be recalled that the release from PI experimental design is predicated upon 
the assumption that word meaning resides within a collection of many encoding 
dimensions in STM. One implication of this multidimensional conceptualization of 
memory is that most, if not all, words could be considered to have a number of different 
component values along more than one of these different dimensions (Wickens, 1970, 
1972). That is, the meaning of a specific given word would be found in the quantitative 
degree to which it was encoded along a multiplicity of different dimensions. Osborne 

‘and Frankiewicz (1972), in a factor analytic study of high and low arousal words, 
considered the question of whether rated arousal is a distinct attribute of words or 
whether it is subsumed by other word attributes such as the Evaluation, Potency, and 
Activity factors reported by Osgood et al. (1957), and concluded that arousal is 
negatively evaluated and partly subsumed by the Activity factor. 


Subsequent to the completion of the release from PI experiment, the high and low 
arousal stimulus words were used as the concepts to be rated in a semantic differential 
type procedure. The administrationof the semantic differential followed exactly the 
procedures outlined by Osgood et al. (1957) in terms of scale construction, form of the 
differential (Form II design), and instructions to the subjects. The subjects were 25 
student nurse volunteers from the University of Alberta Hospital. 


Table 1 presents the obtained means for, and differences between, the high and low 
arousal categories for each of the semantic components. Inspection of Table 1 shows 
that the high arousal category was negatively evaluated, strongly active, and potent. 
The low arousal category was positively evaluated, active, and neutral with regard to 
potency. These data are consistent with the Osborne and Frankiewicz (1972) results. 


In general, Wickens (1970) and others have found that a release from PI of 60% or 
greater usually is obtained only along semantic dimensions (which includes changing 
the language of the symbol) and when the requirements of maximal polarity and 
homogeneity exist for the particular semantic dimension employed. In the present 
study, words were not chosen to be homogeneous or to maximize polarity along the 
activity, potency, or evaluation dimensions. This is reflected in the range of the rating 
means along each dimension as shown in Table 1. It is apparent that the range is low 
and clearly nonpolar in all cases. Thus, it seems unlikely that any of the above 
dimensions accounted for the release from PI obtained in this study. The most 
probable explanation of the release from PI obtained in the present study is that word 
arousal is a significant encoding dimension which exists within semantic space. 
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GRACE M. H. GAYLE 


University of Ottawa 


Another Look at Personality Motivation 
and Second Language Learning 
in a Bilingual Context 


Research and theory elaborated by Gardner specify the importance of motivation 
to second language acquisition and interpret the role of specific attitudes and 
orientations, including the integrative and instrumental orientations, to be 
supports of motivational intensity. In order to analyse some of the more intricate 
processes in second language acquisition, the objective of the present study was 
to examine relationships, deduced from theories, between the two orientations 
referred to and the personality measures known as psychological differentiation 
(field dependence/independence) and dogmatism. Data were observed from a 
sample of 414 Ontario students learning French. Analyses of variance supported 
three of four hypotheses. It was found that significant differences (p< .01 and 
p<.00) exist on the integrative and instrumental orientations in favour of 
extreme field dependent and extreme dogmatic students, respectively. However, 
there were no significant differences on the integrative and instrumental 
orientations for dogmatism and psychological differentiation, respectively. 
Differential treatment, during second language teaching, of the personality types 
discussed is therefore suggested. 


Gardner’ has stated that motivation is a major predictor of achievement in French 
as a second language and that it is the single best predictor of both “Grades in French” 
and “Behavioural Intention to Continue Studying French.” Other reports (e.g., 
Gardner & Lambert, 1972; Gardner & Smythe, 1975) quote evidence of the importance 
of motivation to second language acquisition. Gardner (1978) states further that 
specific orientations and attitudes act as motivational supports during the language 
acquisition process. These findings have prompted examination by the present author 
of some of the subterranean dynamics of motivation, for example, examination of 
personality variables which may significantly predispose the learner toward 
motivational orientations which have proven relevant to second language learning. The 
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intention in this study, therefore, was to attempt clarification of the motivational 
orientations of field dependent/independent persons (Witkin, Dyk, Faterson, 
Goodenough & Karp, 1972) in the second language acquisition context as well as to 
carry out empirical review of dogmatism (Rokeach, 1960) as a possible source of 
specific motivational orientations in second language learning. The study and the 
implications of results found are currently confined to the acquisition of French in 
Canadian bilingual settings where French residents are in the minority. 


The two motivational variables which have received outstanding analytical 
treatment by Gardner and his associates are the integrative orientation and the 
instrumental orientation towards second language learning. The integrative 
orientation may be summarized as the desire to identify with the members of the 
second language group and its culture and possibly to become a member of the group. It 
may be regarded as a total psychological outreach towards the second language group. 
On the other hand, the instrumental orientation implies a utilitarian approach to the 
learning of the second language as a means of improving one’s status or embracing 
career opportunities. 


These orientations are reported to have correlated positively with aspects of second 
language achievement, depending on the language of focus and the sociocultural 
context, and/or to have loaded regularly on the attitudinal/motivational factor found 
to be related to achievement (e.g., Gardner, 1975; Gardner, 1978; Gardner & Santos, 
1970; Gardner & Smythe, 1975; Lambert & Gardner, 1959; Lambert, Gardner, Barik & 
Tunstall, 1963; Lukmani, 1972; Spolsky, 1969). Such loadings have occurred almost 
consistently for English-speaking students, in Canadian bilingual cities, learning 
French (e.g., Gardner, 1960; Lambert, Gardner, Barik & Tunstall, 1963). Substantial 
mean correlations are quoted (Gardner, 1975) between the two orientations and 
“Behavioural Intention to Continue Studying French” for Anglophones across Canada 
learning French. Further, both orientations have been discussed by Gardner (1978) 
with respect to their precise role. They are now viewed as supports, to varying degrees, 
of motivation itself. Claims have been advanced concerning the interdependence of the 
integrative and instrumental orientations (e.g. Smythe, Stennett & Feenstra, 1972). 
However, the author considers these orientations to be sufficiently distinct both in 
nature and in import (Gardner, 1978) to merit separate analysis. 


A third variable which has shown repeated evidence of a strong relationship with 
second language achievement is the integrative motive, an attitudinal complex which 
includes measures of the integrative orientation, positive affects such as positive 
attitudes towards the learning situation or the second language teacher and 
motivational intensity itself (Gardner & Smythe, 1975, pp. 7-12). As may be expected, 
the integrative approach to language acquisition is presented as the essential feature of 
this complex. 


Recent enquiry (Chihara & Oller, 1978; Oller & Perkins, 1978) into the suitability of 
the Gardner-type questions for eliciting attitudinal/motivational information has been 
made. However, Chihara and Oller (1978) have reported that there does not appear to 
be any clear difference between the use of direct (Gardner-type) and indirect 
(Spolsky-type) questions in gathering attitude data. Oller and Perkins (1978) suggest 
sources of nonrandom, extraneous variance inherent to self-reported attitude 
data—namely, self-flattery, response set, and approval motive. No data supporting 
this claim have been brought forward. After careful examination of propositions made 
by the above authors, Gardner’ concludes that they are unsupported and unjustified. 
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The present author is of the opinion that the extent of the evidence offered by Gardner 
et al. relevant to a relationship between attitudes/motivation and achievement in 
second language learning is impressive and merits further consideration in spite of 
incidental failure to find significant positive relationships. 


The selection of degree of psychological differentiation (field 
dependence/independence) as an independent variable was based on recent 
specifications regarding the pervasive nature of this characteristic within several areas 
of psychological function (Witkin et al., 1972); for example, perceptual field 
independence is said to imply, within the social function, a high degree of 
independence and self-sufficiency. The concept of self-sufficiency or “self-non self 
segregation” now refers to social independence in situations which present ambiguity 
or information gaps rather than to social independence with respect to absence of need 
for emotional support, approval or attention (Witkin & Goodenough, 1976, p. 27). This 
“information-seeking” aspect of presence or absence of reliance on others is regarded by 
the present author as having possible relevance to second language acquisition. 


Given the basic assumption in the literature that the field dependent person has a 
general predisposition to rely on the “field” of focus, it is anticipated that the extremely 
field dependent student will appropriate the integrative orientation as a task-specific 
expression of a basic personality need, that is, he is already disposed to assuming 
behaviours and characteristics of others. Further, the ambiguity inherent to the second 
language learning task in a bilingual situation may mean that the integrative 
orientation will assume paramount importance for this type of student. In such a 
context, where two cultural groups live in close contact, the field dependent student 
will be constantly faced with social “similarities” which retain subtle, indistinct 
differences if he is not sufficiently familiar with the second language group. He will 
therefore seek information through integration. Simultaneously, the urgency of the 
integrative orientation for the field dependent student may be expected to forestall the 
expression of a strong utilitarian “instrumental” approach. This suggestion is 
congruent with the explanation of instrumentality as that orientation which 
“deemphasized the interest in becoming closer to the other group and focused instead 
on what a knowledge of the language could do for the individual without involving him 
directly with the group” (Gardner, 1975, p. 25). Providing that the instrumental, 
utilitarian approach is prevalent in the society, however, there is some reason to expect 
a greater positive relationship between the instrumental orientation and field 
dependence than between the instrumental orientation and field independence. 
Significantly higher scores may also be expected for field dependent students on the 
integrative motive as a more complete and intensive expression of the integrative 
orientation. 

The inclusion of dogmatism as the second independent variable was due in large 
part to previous failure to find a relationship between this variable and specific aspects 
of second language learning (Gayle, 1976; Koscielecki, 1979). Gardner and Lambert 
(1972) had previously hypothesized a negative relationship between authoritarianism 
and positive attitudes towards second language learning. On the basis of Rokeach’s 
(1960) theory of dogmatism as a more general interpretation of authoritarianism, 
however, the present research paper assumes a relatively versatile applicability of 
dogmatism to motivational factors in second language learning. The 
attitudinal/motivational disposition of the socio-political “authority” in the in-group 
or the potency of a particular orientation in the in-group is seen as the determinant of 
the kind of relationship between authoritarianism and specific orientations, such as 
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the instrumental orientation, to second language learning. That is, once the authority 
regards second language learning as a utility, the closed-minded dogmatic person 
accepts this idea even if a negative attitude is maintained towards the members of the 
second language out-group themselves. This assertion is in keeping with Rokeach’s 
interpretation of the “intermediate” region of the “central-peripheral dimension”. On 
the other hand, the integrative orientation is thought to be relatively independent of 
the dogmatism factor. To the extent that integration is possible or necessary, the 
second language group will tend to be seen by the dogmatic person himself as one of a 
large number of out-groups which deserve his contempt. His feeling of antipathy may 
even be strong enough to neutralize possible encouragement given by the authority for 
integration and group participation. 


Within the Ontario context from which the sample was taken, the attitudes of the 
authority have been assessed by the author as composite. The Federal Government has 
in recent years actively supported the integrative view within the Canadian population. 
The French minority group has been protected by the Official Languages Act of 1973. 
Residents have been encouraged to become bilingual, to familiarise themselves with 
the cultural values of the French and to preserve political unity. However, these 
overtones may be regarded as embellishments of a basically instrumental, utilitarian 
theme. The policy of political expediency adopted by the government has even been 
reflected, to some extent, in the instrumental bias of the general populace. Among 
members of the Civil Service, many have attempted to learn French as a second 
language in order to retain their jobs or to secure promotion. This situation has been 
extended to many other bilingual services. 


With regard to the above assessment of the Ontario authority and to the 
psychological structures outlined in theory, the following hypotheses seemed valid 
within the present context—that (a) field dependent students would score 
significantly higher than field independent students on measures of the integrative 
orientation and the integrative motive, (b) field dependent students would also be 
significantly different from field independent students on the measure of instrumental 
orientation, (c) no significant difference would be found for high dogmatic/low 
dogmatic students onthe integrative orientation and on the integrative motive, but (d) 
high dogmatic students would be significantly more instrumentally oriented. 


Method 


Sample 

The sample, representing university undergraduates in bilingual areas of Ontario 
who were learning French as a second language, was comprised of 442 students. Only 
those students who described themselves as non-French and as second or 
third-generation Canadians resident in Ontario for at least 3 years (a total of 414) were 
retained. A non-French student was defined as having no significant competence in 
French as asecond language and as living in a residence where French was not spoken. 


Instruments 

Three instruments were utilized in the collection of data—Thurstone’s Closure 
Flexibility Test as a measure of field dependence/independence, Rokeach’s 
Dogmatism Scale as a measure of general authoritarianism, and an attitude 
questionnaire reflecting a random presentation of items used by Gardner and Smythe 
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(1975) in the scales relevant to the integrative orientation, the instrumental 
orientation, motivational intensity, desire to learn French, attitudes towards learning 
the language, and attitudes towards the second language group. 


High reliability figures are reported for the Thurstone Closure Flexibility Test 
(Thurstone, a split-half reliability of .78 and Pemberton, a corrected split-half 
reliability of .94). The test has been found to have a .77 correlation with Witkin’s 
Embedded Figures Test (Podell & Phillips, 1959) and has been used by Witkin to 
measure field dependence/independence. 


As an index of validity Rokeach reports a significant difference of 56.1 (p = .01) on 
the Dogmatism scale between subjects known to be very open and closed-minded. 


Corrected reliability measures for various British and American samples range from 
.68 to .93. 


Kuder Richardson reliability coefficients and item-total correlations reported by 
Gardner and Smythe (1975) are generally satisfactory. 


Procedure 


With respect to the attitudinal battery, average standard scores based on five of the 
six scales referred to (excluding the instrumental orientation) were used as an index of 
the integrative motive. All scales inclusive of the instrumental orientation had been 
randomly presented in the same questionnaire. 


Relationships expected were tested through a series of three 2 x 2 analyses of 
variance using the integrative orientation, the integrative motive, and the instrumental 
orientation as dependent variables. For the purpose of the analyses, selection was made 
of individuals having the 30% lowest and highest scores on psychological 
differentiation and dogmatism. Simple post hoc analyses of variance were conducted. 


Results 


The majority of the expected relationships outlined previously were confirmed by 
the results for main effects. 

As predicted, Table 1 indicates that field dependent students reflected higher 
levels of integrative orientation and integrative motive than field independent 
students. A significant main effect was found for psychological differentiation on the 
integrative orientation (F = 6.44, p<.01) and on the integrative motive, (F = 11.28, 
p<.00). Contrary to expectations, no significant difference was found for field 
dependence/independence on the instrumental orientation. As hypothesized, no 
significant difference was found for high dogmatic/low dogmatic students on either the 
integrative orientation or the integrative motive. However, high dogmatic students 
scored higher than low dogmatics on the instrumental orientation (see Table 1) 
resulting in a significant main effect, (F = 13.09, p<.00). 

There were no significant interaction effects on any of the three attitudinal 
measures. It may also be noted that, on the measure of the integrative motive alone, 
significant differences occurred between field dependent/independent students within 
subgroups for high and low dogmatics, p<.05 and p<.01, respectively. 


All distributions for the three attitudinal and two personality measures included in 
the study were relatively normal (see Table 2). The highest measure of skewness was 
-1.61 for the distribution for dogmatism. Analyses of variance are therefore considered 


to have been reasonably valid. 
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STATISTICS FOR SUBGROUPS IN THE SIMPLE 
POST HOC ANALYSES OF VARIANCE 


TABLE 1 


N Mean SiayD} 
Integrative Orientation 
Fis, Low iD. 39 38.69 8.42 
Mallos liori Mba 42 Boz 9.59 
WaDiag, Islstede De. 29 USL & SXe} 8.82 
a ilcn |bWedm. Die 36 Byewe ols) 
Integrative Motive 
EVD? , LLOWeD:. 39 794 4.03 
Fad 2 okow! Dy 42 -1.59% 3.68 
FiDs> .High Dy 29 1207" 4.59 
Fels, Hien D. 36 = 1.05 4.06 
Instrumental Orientation 
Ey. Den howaD 39 Sill 5 180) The227/ 
Belen Lown 42 3329 7.68 
WGDowg, Telelyea oly 10) 29 BWR) TS y? 
1g iho lebYeday 1D) 36 351592 6.26 
@ Standard scores. 
TABLE 2 
DESCRIPTIVE STATISTICS FOR DISTRIBUTIONS 
M ea Skewness 
Integrative Orientation 371.95 8.59 = oe 
Integrative Motive = 200% 4.06 - .50 
Instrumental Orientation S423 Wash) - .22 
Dogmatism 139027: 36.49 -1.61 
Ee De/B ales 73.96 25.89 - .13 


@ Standard scores. 
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Discussion 


Descriptive statistics quoted for each of the distributions and analyses of variance 
reported above support the view that the personality traits discussed in this study have 
some discriminative power for the second language acquisition process in the Canadian 
bilingual context when French, as the minority language, is the target language. 
Whereas field dependence appears to contribute to language acquisition by virtue of 
the integrative approach, high levels of dogmatism appear to increase the motivational 
drive for instrumental reasons. 


The cause-effect interpretation implied in the previous sentence is supported by 
the content of the three theories under discussion taken together. On the one hand, 
Gardner’s model and empirical examination of his model demonstrate that the 
integrative and instrumental orientations increase motivation which in turn leads to 
language acquisition. On the other hand, field dependence and dogmatism appear to be 
contributing predictors of the extent of the orientations referred to since both Witkin 
and Rokeach confirm the early emergence of the personality characteristics as a result 
of interaction with the childhood environment. Conceptually, it is difficult to see how 
the instrumental orientation to second language learning could lead to dogmatism or, 
for that matter, how the integrative orientation could precede the more general and 
early-appearing field dependent approach. 


In terms of the rationale set out in the introduction to this study, the present 
finding for field dependence relevant to the integrative orientation/motive (exclusive 
of the instrumental orientation) suggests support to the theoretical proposition 
(Witkin & Goodenough, 1976, p. 27) that the field dependent person’s penchant for 
relying on others as a source of information in ambiguous situations provides the 
overriding difference between himself and field independent persons during social 
interaction. Certainly, the importance of the ambiguity of the bilingual context to the 
second language learning process is an interesting supposition which merits closer 
examination in this and in other respects. At this point, however, it is safer to conclude 
that the ambiguity of the second language learning field is one of many possible reasons 
for the difference found between field dependent/independent persons on the 
integrative dimension. 

The failure of significant differences to emerge on the instrumental orientation 
between extremely field dependent and field independent students is surprising. This 
may be due to the greater prevalence of the integrative feeling within the general 
English population, in spite of the basic utilitarian attitude of the Federal 
Government, and the consequent attractiveness of the integrative point of view to the 
dependent person. 

As an additional evaluation of Witkin’s differentiation construct relevant to 
second language learning, it appears that field dependence per se may be more 
consequential than dogmatism to second language learning in those Canadian bilingual 
areas under discussion. This deduction is made since Gardner (1978) had previously 
affirmed the relative non-importance of the instrumental orientation to the second 
language learning process in Canadian bilingual and monolingual settings, although 
correlations as high as .34, p< .01, between the instrumental scale and motivational 
intensity were reported. 

It should be pointed out that statistical analyses adopted in this study involved the 
use of extreme scores on the personality measures. This selective approach, however, 
was considered rational and in keeping with theoretical distinctions. Strong 
correlations were not expected between entire distributions for the personality and 
motivation variables. In the case of field independent students, for example, only 
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extreme scorers were predicted to portray the kind of isolationism which could militate 
effectively against integrative behaviour (Witkin et al., 1972, pp. 174, 175). Similarly, 
the student scoring near the midpoint on the dogmatism scale could be expected to 
have mixed, ambivalent, or unpredictable reactions to the instrumental orientation, 
particularly in view of the interdependence between this variable and the integrative 
orientation in interpretation (Smythe, Stennett & Feenstra, 1972). At the extreme 
dogmatic end of the scale, however, where the individual tends to see the outgroup as 
“anathema,” the instrumental expression may quite easily become the only option and 
may be experienced and interpreted in aclear-cut manner. Results confirmed that 
correlations between field independence and the three dependent variables and 
between dogmatism and the dependent variables were in no case higher than -.13 and 
.09, respectively. However, the direction of the correlations was consistently supportive 
of the kinds of differences hypothesized previously for extreme scorers on the 
independent variables. 


In general, data indicate, therefore, that the roles of field dependence and 
dogmatism within the psychological dynamics of second language acquisition are 
neither negative nor negligible. Further studies should be carried out on a comparative 
basis in order to determine, for example, whether field dependence retains a significant 
effect on the integrative orientation in a non-ambiguous monolingual setting. In 
addition, the question of whether or not similar submotivational relationships would 
be discovered in different bilingual circumstances is of interest. For example, what 
might be the relationship between dogmatism and the instrumental orientation in 
cases where English, as the minority language, is being learned by members of the 
French majority group? Sociological and political issues which have developed over the 
years suggest that relevant findings in regions of Quebec, for example, may be quite 
different. 


The implications of the present findings for educational practice are obvious. They 
also defy disregard. It would appear that, in the Canadian bilingual setting where 
French is being taught to members of the English majority group, teachers must not 
only be aware of the relative effects of distinct motivations in second language learning 
but, in their attempt to reinforce attitudes and motivations, they must also be sensitive 
to the nature and personality of the student being taught. Shifts in emphasis or 
overemphasis of particular orientations may be either acceptable, unimportant, or 
threatening to the individual student. 


Second language learning theorists and researchers have intimated often in recent 
years that personality is of some consequence, direct or indirect, in the acquisition of 
the language. The present study has demonstrated certain subtleties in an indirect 
relationship between personality extremes and language learning as_ these 
relationships have been observed in a bilingual context. Given the complexity of 
human psychological function, knowledge of such intricate dynamics should be most 
useful and challenging to the second language teacher. 


Notes 
if Gardner, R. C. Psychological aspects of second language acquisition. Paper presented at the conference, “Perspectives 
of Language,” University of Louisville, Louisville, Kentucky, May, 1976. 
2 Gardner, R. C. Affective variables in second language acquisition: Back to basics. Research Bulletin No. 497, 


Department of Psychology, University of Western Ontario. Submitted to Language Learning. 
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The conception of this study occurred in response to public concern of whether or 
not children enrolled in grade 3 in 1977 acquired the same level of academic 
achievement as did children 21 years ago. In order to investigate this question, 
the results of a battery of achievement tests administered to students enrolled in 
grade 3 in the Edmonton Public School system in 1956 (N=3 570)' were compared 
to the results of the same test battery (corrected for item bias due to social 
change across time) administered to students enrolled in grade 3 in the 
Edmonton Public School system in 1977 (N=4400)'. The results from this study 
indicated that the overall level of grade 3 communication skills had increased 
from 1956 to 1977 while mathematical skills had decreased slightly. It was noted 
that the decline in mathematics scores was not general, but related to items in 
specific areas, such as the Roman numerals. It was pointed out, also, that the 
1977 grade 3 students averaged 1.72 months younger than their 1956 
counterparts. 


For several years there has been a growing criticism of student achievement in 
Alberta as well as in other parts of Canada. Newspapers and periodicals have 
frequently expressed complaints, usually undocumented, that basic skills are 
unsatisfactory and that they have deteriorated in recent years (Canadian Chamber of 
Commerce Education Committee, 1975; Saint John’s Edmonton Report, 1976; Gray, 
1975; Edmonton Journal, 1975). Research studies have reported mixed findings. 
Student achievement declines were reported by Muir (1961) and Alberta Education, 
Red Deer Regional Office (1976), while Hedges (1977) reported general improvement 
with the exception of arithmetic computation and arithmetic reasoning. The Nova 
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Scotia Department of Education (1976, 1977) reported higher performance in all 
academic skills with the exception of some areas of mathematics. 


A study was conducted for the purpose of comparing grade three achievement 
standards in Edmonton in 1977 with achievement standards obtained in 1956. This 
study was carried out through the Alberta Advisory Committee on Educational Studies 
and funded by Alberta Education. The project, referred to as the Edmonton Study, 
involved all grade three classes in the Edmonton Public School system in 1956 and 
1977. Comparisons for the Edmonton Study were made possible by the fact that an 
extensive battery of tests had been administered to grade three pupils in Edmonton 
public schools in 1956, and that the test booklets with the students’ answers had been 
retained. A re-administration of the test battery was then required in order to 
determine overall gains or losses in those levels of achievement measured by the test 
battery. Full details regarding this study are documented in two reports published by 
Alberta Education (Clarke, Nyberg & Worth, 1977,1978). 


Procedure 


The general procedure employed in the study was to administer, in 1977, the same 
battery of tests that had been given in 1956 to the grade three pupils enrolled in the 
Edmonton public school system. Testing conditions regarding such matters as time of 
year, testing instruments employed, and testing personnel were duplicated in 1977 as 
far as possible. In addition, uniformity of standards in scoring was ensured by having 
the 1956 tests re-scored by the same people charged with the task of scoring the 1977 
test booklets. 

The tests employed in the Edmonton Study were as follows: 

1. California Achievement Tests Complete Battery Primary Form CC, 1950 

Edition. 

2. California Short Form Test of Mental Maturity Primary S-Form, 1953 Edition. 

3. Gates Advanced Primary Reading Tests Form I, Type I Word Recognition 1942 
Edition. 

4. Gates Advanced Primary Reading Tests Form I, Type II Paragraph Reading, 
1942 Edition. 

5. Raven Progressive Matrices 1947 Sets A, Aj, B. 

All of the tests, with the exception of the Progressive Matrices, were out of print in 

1977, therefore special permission was obtained from the test publishers to reprint 

them at the University of Alberta. 

Testing was conducted, in both 1956 and 1977, by university students and by the 
grade three classroom teachers. The teachers administered the California Achievement 
Battery and the two Gates Reading tests, but the two ability tests were given by the 
university students. 

All the tests were administered to the students during mid-May to early June in 
both 1956 and 1977. The Progressive Matrices were administered to a sample of 
approximately three-fifths of the total group of pupils in 1977; that is, grade three 
pupils in three-fifths of the schools were given this test. This was necessary in order to 
permit four other samples to be tested in connection with a study of grade three 
achievement in Alberta which was being conducted simultaneously. 

Curricular validity was of particular concern in this study. The fact that the tests 
listed above were chosen for use in 1956 would indicate that they were considered to be 
valid, at least insofar as content was concerned. There was a fear, however, that the 
content would be less valid for the 1977 pupils. In order to ensure curricular validity in 
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1977, three groups of reviewers were asked to identify those test items thought to be 
inappropriate for the 1977 students. These reviewers were University of Alberta 
curriculum experts, Edmonton Public School Board supervisory staff with 
responsibilities in elementary education, and the teachers of all participating grade 
three classes. The reviewers identified seventeen test items, one with two parts, as 
being inappropriate for the 1977 students. Verification of these judgments was 
accomplished through item analysis procedures. That is, it would be expected that the 
difficulty levels and the discrimination indices (point biserial correlations) of the 
inappropriate items would be different in 1977 as compared to 1956. As the students 
took the complete test in every case, item elimination was carried out as part of the 
analysis procedure. Statistics for the complete tests were also obtained as a matter of 
interest. 


TABLE 1 


STATISTICS OF ITEMS DEEMED INAPPROPRIATE 


Item Item 
Ain at Sates ae Fen t Difficulty® Discrimination 
tS Ome 195 ORL OM, 
ied ee 24 Word veil 7 Lane gos 727 ee OO 
Recognition Sa 
42 Word arbor 43 42 5a5) - 30 
44 Word dormitory 0 5) SS) 33 
47 Word pugilist 28 26 KO 04 
Gates Paragraph 
Reading Shil Word temperature gals -20 ae 5a)5) 
Syl (2nd part) temperature yall 29 - 36 SS 
C.A.T. Arithmetic 
3 Sec C 8 Roman III ate’ se 37 33 
3 Sec C 9 Roman VI - 66 5 2) sake 5 aks 
3 See GC 10 Roman XX - 68 2 -40 -36 
3 Sec D 10 Meaning of 1b. - 80 82 2 Bi 343 
3 Sec E 3-Pith 2 Etenhow . 82 ones} 730 5 By 
3 Sec E 4—Pt.2 Write how cis) a0 o 3 a2 
3 Sec E 4-Pt.2 Write how . 86 . 84 woe 34 
4 Sec I 4 Add 4 digit nos. . 86 5 i A oo 
4 Sec I /i Subtract 4 digit nos. 3 52 Sie) 0 -40 
4 Sec I 8 WAwLltenioulyy YY ste Sh Glakealies HO BOS gdbih ads 
4 Sec I 9 16 + 4 528) 50 jae) S35) 
4 Sec I 10 SIO) <p 35 .09 .02 .20 ols) 
2 Proportion of pupils who responded correctly to an item. 
b 


Point biserial correlation between success on item and score on all other 
items. 
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Students Tested 


In 1956, the tests were administered primarily for norming purposes; hence, 
virtually all of the grade three Edmonton Public School pupils were tested. In keeping 
with the policy of duplicating the 1956 procedures in 1977, all grade three Edmonton 
Public School pupils were scheduled to be tested. In both 1956 and 1977 only those 
pupils present on the designated testing days were tested. 


Results 


Table 1 indicates the test questions that were deemed inappropriate for the 1977 
pupils, and provides the item analysis data for each of these test questions. 


There was no need to exclude any of the test items for the two Gates tests. For some 
items that were deemed inappropriate (i.e., Word Recognition items 24 and 25) the 
1977 performance was lower, as expected, but for the remaining excluded items (i.e., 
Word Recognition items 44 and 47, and Paragraph Reading items number 31, both 
parts) the 1977 performance was actually higher. In the California Arithmetic test, 
however, elimination of several test items was justified. In eight of the twelve items 
deemed inappropriate, the 1977 pupils achieved below the 1956 standard, as is 
evidenced by these test item difficulty levels. With regard to the remaining four test 
items, the 1977 pupils achieved above the standard set in 1956, but in all cases the 
differences were marginal. 


Table 2 indicates the results for the tests when the inappropriate test items were 
eliminated, while Table 3 indicates results for the complete tests. Table 4 indicates the 


results of the two ability tests. It should be noted that data for the Raven test were 
based on asample, as mentioned earlier. 


TABLE 2 


1956 and 1977 SCORES ON TESTS 
WITH INAPPROPRIATE ITEMS EXCLUDED 


Increase in 
1956 ie Sa SMT a eee 
Test SSS eee ' ee 3.D mean from 
N sete Meme’ a Lk ‘1956 to 1977 
Gates Word Recognition 3 540 35.35) eo” 4 430 B37 ma Ae Ol +2 .06 
Gates) Paragraph 3569 22.27 4.14 4 430 22.66 4.03 4439 
Reading 
aH IES) 
Total Gates Reading 57s OZ 60.07 
California Achievement 4, .7¢ 75,26 6.97 4 417° 77.58 5.93 +2. 32 
Reading 
California Achievement 3 377 55.98 5.97 4 367 55.56 7.58 =a ue 
Arithmetic 
California Achievement 3 564 94.24 5.35 4 391 25.41 5.17 Peay 
Language 
California Achievement 155.48 158.55 1307 
Total 
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TABLE 3 


1956 and 1977 SCORES ON COMPLETE TESTS 


Increase in 


Test Lee Pann TURE ORC) Pe waeweee Dt ee ees mean from 
N Mean  S.D. N Mean S5S.D. 1956 to 1977 
parce yore 3 S400 3) 26 erOr Lo) 49430 39.35 oF +2.10 
Recognition 
cree See con 3.569 22.63 4.37 4430 23.15 4.36 +0.52 
Reading 
Total Gates Reading 59.91 62:53 +2.62 
erpeinee ati cy marti ON Ys La aye eA VER remiss, ence: +2.32 
Reading 
Cati fornia) Bch Levesicn ly ian i7 ae eee Ore EST Olt MOS cin 
Arithmetic 
supe arininage asi ae ec iS VAST! oA, Se Re Se we AML HO Glarny NEN +1.17 
Language 
California Achievement 162.94 166° 71 P97 
Total 
TABLE 4 
1956 and 1977 SCORES ON ABILITY TESTS 
Increase in 
Test ee eee eee eee oe mean from 
N Mean S.D. N Mean  S.D. 1956 stow L977 
California Test of 
Mental Maciel ey S443 O09 Ole LG 4 S/O nC Ole 2mm Orie 2 4.03 
Raven Progressive 
Meceiese SBE) (BASSE SoS 2 BUY BioPS) S058 2.46 
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Discussion 


This study appears to provide solid evidence that the level of achievement in the 
communication skills for grade three pupils in the Edmonton Public School system had 
risen in the period from 1956 to 1977; this difference occurred despite the fact that the 
1977 pupils were, on the average, 1.72 months younger. In the case of mathematics, 
performance of the students, as defined by the 1956 edition of the California 
Achievement Test of arithmetic, had declined. The decline was not general, however, 
but appeared to be specific to a few test items. This indicates that the decline may be 
more a function of changes in teaching emphasis resulting from curriculum changes 
than from a deterioration in pupil performance. For instance, lower scores obtained 
from items involving Roman numerals were to be expected because most of the 1977 
pupils were taught the Roman symbols for numbers one to ten only. Similarly, in 
questions involving money, most of the 1977 grade three classes dealt with amounts up 
to one dollar only. Whether or not these curriculum changes are desirable was not a 
matter for study in this project. 


The criticism of standards of education appears not to be justified, at least with 
respect to the first three grades of schooling in the Edmonton Public System. 


Notes 


ip The Ns varied slightly among the tests within the test battery. 
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The characteristics of 567 doctoral dissertations reported in the area of 
secondary reading from 1960-1976 are described, including: (a) discussion of 
demographic variables such as yearly production of dissertations, type of degree, 
sex of author, institutions awarding degrees; and (b) content analysis of topics, 
research type, domain of investigation, data analysis techniques, and 
instrumentation used. Results reveal that for the period 1960-1976: more EdD 
than PhD degrees have been awarded, the number of female degree recipients is 
increasing with the degree tending to be the PhD, topical trends in dissertations 
vary by year and between the two degrees, EdD dissertations show more 
variability in length, and there is a trend toward more empirical research and 
greater use of sophisticated data analysis techniques in secondary dissertation 
research. It is concluded that the dissertation literature is a valuable source of 
information and research ideas in the area of secondary reading. 


The International Reading Association’s third edition of Graduate Program and 
Faculty in Reading (Seifert, 1978) lists information on over 300 colleges and 
universities offering graduate programs in Canada and the United States. There is no 
dearth of opportunity to get involved in graduate work in reading, including advanced 
doctoral study. Approximately one-third of the institutions list doctoral programs 
leading to the EdD or PhD degree. 


Dissertations are almost universally required for the doctoral degree in North 
America. Surveying doctoral offerings in education in 145 U.S. institutions, Robertson 
and Sistler (1971) found that more than 90 percent of both EdD and PhD programs 
used the dissertation as the only means of satisfying the terminal research 
requirement. Robertson’s 1971 study revealed similar requirements for Canada. 
Although some institutions are experimenting with substitute requirements, the 
dissertation shows no sign of being immediately abolished. The sheer bulk of 
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dissertation literature is impressive, with 340 000 doctoral degrees awarded between 
1861 and 1970 and a projected figure of 340 000 more for the 1971-1980 decade (Wolfe 
& Kidd, 1971, p. 784). University Microfilms (1974, 1975) reports that for 1960 to 1975, 
a total of 63 929 dissertations were completed in the field of education — almost 20 
percent of all dissertations produced. 

The purpose of the research reported here is to add information not found in the 
IRA publication and provide a more complete picture of graduate study in secondary 
reading in North America. 

The source used for dissertations was the abstract publication Dissertation 
Abstracts International (DAI) published by University Microfilms. Volumes 20 
through 36 (July 1961 - June 1976) were hand searched to provide a 15-year corpus of 
abstracts for analysis. Hand searching was found to be best. Studies can be hidden in 
index schemes because of vague titles or the manner in which the titles are classified. 
Some dissertations have not been included because some institutions, both major and 
minor, do not choose to participate with DAI. The total omitted would be less than two 
percent for the time period studied. 


Reading research was defined as being any dissertation dealing, in whole or part, 
with reception of communication through print. This broad definition was 
deliberately used to capture all dissertations with obvious reading content. Then those 
studies dealing with grades seven through twelve, or using at least in part an adolescent 
sample, were selected as being dissertations related to secondary reading. Using this 
definition produced a total of 567 secondary titles. Each DAI abstract was tabulated 
according to 10 demographic and content variables for computer manipulation 
(demographic variables: year of completion, degree type, sex of author, institution, 
length; content analysis variables: topic, type of research, educational domain, data 
analysis techniques, data gathering instruments). 


Descriptive Data 


In examining dissertations completed by year, it was found that a somewhat 
dramatic surge occurred in the period 1971-1976 (dissertations completed in 1960-1965 
= 15% of the total, 1966-1970 = 22%, 1971-1976 = 63%) perhaps reflecting 
expanding interest in secondary reading as an area of professional study. 
Approximately equal numbers of PhD’s and EdD’s were completed across the 15 years 
(54% and 46% respectively) with, overall, seven out of ten dissertations being granted 
to males. The 567 dissertations were granted from 115 separate institutions with a 
small number of colleges and universities producing better than 50 percent of the total. 
The 16 institutions which granted 10 or more secondary dissertations are listed in 
Table 1. Dissertations ranged in length from 45 to 685 pages with a median length of 
157 pages. The page length distribution was positively skewed with the standard 
deviation of 85.7 emphasizing marked variability in the length of dissertations. 

Using an empirical method of classification (Jeroski, 1977), the abstracts were 
organized into 30 content categories and subcategories. Other researchers have 
reported use of classification systems based on authoritative opinion but such systems 
usually result in considerable loss of information. For example, Palmatier and Manzo 
(1974) and Palmatier and Rood (1975) reported analysis of 273 and 371 dissertations in 
reading for 1973 and 1974. Their subjective classification scheme, consisting of 31 
categories, was used to organize the 1973 collection and resulted in 68 percent of the 
dissertations being classed as “other.” A broader typology was developed for 1974, 
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consisting of 43 categories, with 28 percent of titles being classed as other. With the 
empirically developed categories (in which the content of the documents determines 
the classification categories) used in this study, it was possible to classify precisely 
every dissertation and avoid use of the ubiquitous “other” category. 


TABLE 1 


FREQUENCY OF DISSERTATIONS BY INSTITUTION 


Institution Degrees Awarded 
New York University 26 
Florida State University 2S 
Boston University School Education 20 
Indiana University 20 
University of Southern California 17 
Syracuse University 17 
University of Pittsburgh 16 
Temple University 13 
University of Oregon 13 
Columbia University Teachers’ College 12 
University of Georgia iW 
Arizona State University Ted 
University of Northern Colorado 10 
University of Minnesota 10 
University of Illinois 10 
University of Wisconsin 10 
No. of Reported No. 
Institutions of Dissertations 

4 9 

3} 8 

7 7 

7 6 

8 5 

5 4 

16 3} 

ied 2 

32 iL 


The 30 categories were collapsed into the 12 broad topics appearing in Table 2 to aid 
in interpretation of overall trends. The four categories, Reading Skills, Correlates of 
Reading Achievement, Reading in Content Areas, and Reading Problems accounted 
for six out of ten dissertations across the 15 years. The most striking characteristic of 
the original 30 categories and the 12 collapsed categories in Table 2 is the wide array of 
topics in secondary reading which were of interest to the 567 doctoral candidates in 
their research. 
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TABLE 2 


FREQUENCY OF DISSERTATIONS 
BY COLLAPSED CONTENT CATEGORIES 


Content category N % 
Correlates of Reading Achievement q3 13623 
Measurement, Evaluation and Diagnosis D2. 3.88 
Motivation and Attitude Toward Reading 2, 4.76 
Response to Reading hl 3) 
Reading Programs 41 Hise 
Materials 35 6.17 
Reading Personnel 40 7.05 
Reading Skills 129 DOTS 
Reading in Content Areas 71 WS BY 
Reading Problems 67 GZ 
Atypical Learners 19 Sha SI) 
Miscellaneous Research 10 10a HAS 


With respect to general focus, more than half the dissertations dealt with the 
cognitive domain, one-quarter with a mixture of the cognitive and affective, and less 
than three percent had the affective domain as the primary concern.The most frequent 
type of research was quasi-experimental with correlations, experimental, and survey 
research the next most frequently occuring categories (Table 3). It is interesting to 
note that almost half the dissertations were classified as either experimental or 
quasi-experimental suggesting that some type of experimental manipulation of data 
took place in the study. 


TABLE 3 


FREQUENCY OF DISSERTATION BY RESEARCH TYPE 


Research Type N vs 
Quasi-experimental 1723 Shoe sy7/ 
Correlational 131i 23.14 
Experimental 85 15.02 
Survey 68 12501 
Content analysis 49 8.66 
Material development ZA. Baia: 
Philosophical 18 Biko 
Ex post facto 14 2.47 
Other 7 1.24 


On i SS SS eee 
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Table 4 lists the data analysis techniques used. Simple correlation and descriptive 
analysis accounted for almost 50 percent of the research with analysis of variance 
techniques adding another 20 percent. More sophisticated techniques such as factor 
analysis, multivariate analysis of variance, and multiple discriminant function 
analyses were less often cited. The most frequently cited data gathering technique (24 
techniques in all were identified) was the standardized achievement test with other 
frequently used techniques being nonstandardized achievement tests, standardized 
aptitude tests, demographic classifications, and nonstandardized survey instruments. 
More than one data gathering technique could be reported in a study and a total of 
1 370 techniques were cited for the 567 dissertations. 


TABLE 4 


FREQUENCY OF DISSERTATION 
BY DATA ANALYSIS TECHNIQUE 


Data Analysis Technique N 
Simple correlation is Po Syl 
Descriptive statistics 130 PBS \ 5 2X3} 
Analysis of variance WAT 22.40 
t test 66 11.64 
Analysis of covariance 59 10.41 
Multiple regression iby) 6.70 
Chi square 29 Sy qJlal 
z statistic 7) 4.76 
Not applicable 26 4.59 
Multivariate anova iy, 3.00 
Factor analysis 10 ye /,6 
Nonparametrics 7 123, 
Discriminant function 4 6 (Al 
Not given 139 Dies. Sil 

Bivariate Analyses 


Analysis of the nominal data, organized into bivariate tables using Pearson’s Chi 
square, produced the interesting results summarized in Table 5. 


Year of Completion. Examination of year by type of degree indicated a significant 
relationship (x? = 15.18, df = 2) which is accounted for by the fact that there was a shift 
to more PhD’s granted since 1970, reversing the 1960’s trend where the EdD was more 
prevalent. For the total 15 years, three out of four doctorates were awarded to males. 
However, a highly significant relationship was found for sex and year of completion 
{x’= 15.18, df=2) which indicates a sharp increase in female doctorates since 1970. 
Analysis of variance revealed mean page length was not significantly related to year of 
completion, which suggests that, although slight differences in mean length occurred 
each year, the wide variability in page length noted earlier was typical across the 
15-year time span. The topics of dissertations have changed over the years as indicated 
by the highly significant relationship (x?=49.56, df=20) between year of completion 
and content category. Most notably, the content categories Response to Reading and 
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Reading Personnel represent a chronological increase while dissertations on Reading 
Problems and Reading Programs have decreased. Because of the low cell frequency, it 
was not possible to evaluate the data for research type and data analysis techniques 
statistically in relation to year of completion. Overall, experimental studies have shown 
an increase while quasi-experimental, correlational, and survey studies have remained 
proportionally the same. Analysis of variance (including covariance and t tests) and 
multivariate data analysis techniques have increased proportionally over the years. 


TABLE 5 


SUMMARY OF BIVARIATE ANALYSES 


Variable Year Degree Sex 
Year -- * KKK 
Degree x -- Te Sis 
Sex KKK Mie == 
No. of pagest Tie Ste n.s. n.s. 
Content classification AKK aK n.s. 
Research type invalid Noe: 1. Sis 
Data analysis technique invalid n.s. n.sS. 


* 
kk 


chi square significant at .05 
‘chi square significant at .01 


kkk = chi square significant at .001 
n.s. = not significant 
invalid = cell size insufficient for chi square analysis 
+ = relationships with mean number of pages 


analyzed by using / statistic 


Type of degree. As reported above, year of completion was significantly related to 
type of degree. Type of degree was not significantly related to sex of author (x?=2.57 
n.s., df=2) although the data reveal a higher proportion of PhD’s being awarded to 
females. Analysis of variance indicated that page length was not significantly related to 
type of degree, mean page length being slightly higher for thePhD. However, a notable 
difference in standard deviation for page length by degree was found using Bartlett’s 
test for homogeneity of variance; much greater variability was evident in the length of 
the EdD. The topics pursued in dissertations have varied depending on whether the 
degree was the EdD or the PhD as indicated by the significant relationship between 
type of degree and content category (x’= 26.16, df=2). A disproportionate number of 
PhD’s appeared in the categories Response to Reading and Reading in Content Fields 
while a disproportionate number of EdD’s appeared in the categories Motivation and 
Attitude Toward Reading, Reading Programs, and Measurement, Evaluation and 
Diagnosis. Type of research and data analysis technique were not significantly related 
to degree type (x? = 8.36 ns., df=8; 4.87 ns., df = 6, respectively). This is an 
interesting finding and supports the observations of Ashton (1965), Spurr (1970), 
Boyer (1972) and others who have suggested that any theoretical differences between 
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Sex of author. A disproportionate number of females completed degrees in the 
1970’s, as revealed by the highly significant relationship between sex of author and year 
of completion (x? = 15.18, df = 2). Sex of author, however, was not significantly related 
to type of degree awarded (x?=2.57 n.s., df=2), length of degree or page variability, 
topic of the dissertation (y?= 10.79 n.s., df=10), type of research (x?=6.89 n.s., df=8), 
or data gathering technique used (x? =6.06 n.s., df=6). 


In summary, the bivariate analyses revealed the following significant relationships: 
(1) more PhD’s than EdD’s have been awarded since 1970, (2) more females have been 
completing doctorates in secondary reading in recent years and the degree tends to be 
the PhD, (3) the topical emphases in dissertations changes year by year and EdD’s and 
PhD’s vary in the topics researched, and (4) although the average length of the EdD 
and the PhD are similar, the EdD shows markedly more variability in page length. It is 
also of interest to note the trend toward more empirical research and greater use of 
sophisticated data analysis techniques in secondary dissertations. 


Some General Observations 


The first supplement to the secondary dissertation collection has already been 
completed. Volume 37 of DAI was searched and 87 dissertation located which fit the 
definition of secondary reading presented earlier. Strong research interest in secondary 
reading continues and analysis of the additions to the collection supports many of the 
trends noted previously. Although yearly trends are of interest, it is felt that it would be 
of most value to report a thorough classification and content analysis of secondary 
dissertations about every five years to determine consistency and significant changes in 
overall trends. 


Dissertation titles were found, in general, to be good indicators of content. All 567 
titles were analyzed and non-informative words (articles, prepositions, conjugations, 
pronouns, and auxiliary verbs) were eliminated. This produced an average of six 
informative words per title indicating good descriptive titling for the collection. 


The list of institutions producing ten or more dissertations provides a useful guide 
to colleges and universities where considerable work has been done in advanced study 
of secondary reading. The list of all 115 institutions reporting dissertations is available 
upon request. 


The abstracts reported in DAI were of uneven quality. Others have noted the same 
difficulty in working with collections of dissertations (Boyer, 1972; Brehaut, 1969; 
Gross, 1972). Along with a succinct statement of the problem, authors should be 
particularly careful to include the following in their abstracts: (1) type of research, (2) 
educational domain, (3) data analysis techniques, (4) data gathering instruments, and, 
if applicable, (5) clear description of the independent and dependent variables and the 
results of any statistical analysis. Advisors should take particular care in supervising 
the preparation of the abstract submitted to DAI to insure that adequate information 
content is being reported. 


Glass (1976) suggests that, “The dissertation literature is huge and filled with gems. 
It’s a scandal that few persons other than doctoral students ever enter it” (p 12). 
Empirical evidence also points to the fact that few journal articles are written on 
dissertations (Boyer, 1972; McPhie, 1959) with less than three out of ten dissertations 
exploited for further publication. In this analysis, many interesting, substantive ideas, 
research findings, and the like were noted in dissertations which have never seen the 
light of day in the open literature. Dossick (1972) was correct in his suggestion that, 
“[Dissertations] are conspicuously shelved in the university libraries, to gather the 
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proverbial dust” (p.2).Others, too, have found the dissertation literature to be of great 

value. Yesseldyke and Pickholtz (1975) state, “Perhaps the best indicator of research 

interests and trends in a particular field is a review of theses and dissertations being 

completed for doctoral degrees” (p. 264). Their comment is echoed by Draper (1974), 

Aaa (1974), Palmatier and Rood (1975), and Worthington (1975), to name but a 
ew. 


In conclusion, the points made by Dossick (1972) can summarize our attitude after 
working with the extensive collection of 567 dissertations completed in the field of 
secondary reading from 1960-1976. 

[Dissertations] contribute much to knowledge because of the highly specialized 
character of the data, the results of minute research under expert guidance, and because 
of the wide use of primary sources, experimental investigations, statistical information, 
etc. .. .Over and above much of the pedestrian and at times, mediocre, there remains a 
huge vast storehouse of valuable research which represents a “frozen asset” of data ready 
to be tapped like a rich vein in a mine. (p. 2) 


Appreciation is expressed to Sharon F.M. Jeroski for use of her M.A. thesis data. 
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Use and Perceived Effectiveness 
of Occupational Information 


The use of occupational information and its effectiveness as perceived by 
counsellors and guidance personnel in Alberta secondary schools is limited. 
Research is reported which shows the type and degree of limitations. Counsellors 
appear to have difficulty relating to students about occupational information 
because of their own lack of practical experiences in the world of work. The 
perceived role of the counsellor does not give a high priority to use of 
occupational information. Administrative problems, the school’s philosophy and 
attitudes seem to be obstacles in promoting effective use of occupational 
information. 


Occupational information is needed by students in secondary schools for effective 
career preparation and planning. Unfortunately, in many schools, adequate 
occupational information is not available and, in a number of schools,is not effectively 
used when it is available. Sinick (1970) indicates that guidance workers themselves all 
too often raise questions about the validity and utility of the dissemination of 
occupational information to aid them in carrying out one of their responsibilities. Years 
ago, Shartle (1961) advocated that training in the use of occupational information be 
considered just as important as training in any other personnel or guidance procedure. 
In terms of importance, Shartle said, occupational information is frequently passed 
over as something anyone should be able to understand and use. This situation does not 
help students who wish to learn about their possible career interests and options. 

Mott (1971) found that occupational materials were available in the school for 
teachers’ use, but apparently counsellors were not assisting teachers in developing 
strategies for their most effective use. 

Smith and Cadera (1978) indicated that one of the problems inherent in career 
development approaches is oversimplification of the discrimination of occupational 
information as a way of sharing vocational decisions. It appears that many times 
occupational information is given to students without adequate explanation or 


Dr. Young (PhD, Ohio State University) has been a secondary school teacher of industrial arts in Colorado, a vocational 
teacher in Ghana, and a university professor at Ohio State University, Oregon State University, and Ahmadu Bello University 
in Nigeria. He has served as advisor and consultant in Nigeria, and travelled in 40 countries. His academic achievements 
include research and publishing in the area of Industrial and Vocational Education in Canada, the U.S.A., and overseas. 
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This is reflected in Clark’s (1975) study in regard to the amount of time guidance 
counsellors spend dealing with career information. Only 19.6 percent of guidance 
counsellors’ time was spent on career information. 


This is a report of a research project aimed at determining the use and perceived 
effectiveness of occupational information in Alberta secondary schools. The study was 
conducted by the author and graduate students at the University of Alberta in the 
Department of Industrial and Vocational Education during the period of 1977 - 1979. 


Objectives of the Study 


The major purpose of the study was to obtain information across the province about 
the use of occupational information and its perceived effectiveness in the secondary 
schools. 


Specific objectives included the development of an instrument, administration of 
the instrument, data collection, and analyzation and preparation of a final report. 


Method 


The term “occupational information” as defined by Norris, Zeran, and Hatch (1965) 
was used as a guide for this study. They defined occupational information as valid and 
usable data about positions, jobs, and occupations, including duties, requirements for 
entrance, conditions of work, rewards offered, advancement pattern, existing and 
predicted supply of and demand for workers, and sources for further information 
(p.22). 

Occupational information for the purposes of this study was defined much more 
naturally. It was comprised of two aspects: (a) material related to the preparation 
process for entrance into the world of work (i.e., data regarding training and 
educational programs leading to specific occupations) and (b) material which 
concentrates on the specific jobs and which is potentially useful to a person considering 
these jobs. 


Population. In view of the obvious need and importance for occupational 
information by secondary school students in career planning, it was decided to 
administer the instrument to all secondary schools in Alberta. The Alberta 
Department of Education provided a list of schools offering secondary education. 
These included schools of grades K-12, 7-12, 10-12, and composite high schools. In part, 
the Alberta schools which made up the population may be considered a sample of 
similar schools across Canada. 


The research instrument was developed after conducting both a literature survey of 
similar instruments and a series of interviews with educators involved in career 
planning. After conducting local pilot studies, the final draft of the instrument was 
mailed to all the secondary schools in Alberta. 


The instrument was made up of five sections dealing with the following: general 
information, resource materials, methods of providing information, sources for 
obtaining materials, and services which might be improved to facilitate career guidance 
in the secondary schools. 


Data Collection. Questionnaire data were coded, compiled, and entered on an 
80-item data sheet to be transferred onto key punch cards. A computer program was 
prepared to analyze data from the returned questionnaires. Of the 265 mailed out, 225 
were returned. Of these, 10 could not be used since they were not included in the target 
population. Thus, 215 instruments (81% ) provided usable data. 
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Results 


General information from the instrument indicated that 33% of schools were 
comprised of grades K-12; 26%, combined junior-senior high schools grades 7-12; 
20%, senior high schools grades 10-12; 11%, composite high schools; and 10% were 
classified as others. 


The data indicated that 39% of the schools were situated in towns, 31% in cities, 
and 30% in villages. The student enrollment varied greatly from the rural areas to the 
cities. 

The type and number of personnel dealing with occupational information was not 
consistent from school to school. The data showed that 53% of the schools had one or 
more counsellor and guidance personnel; 5% were sharing a counsellor and guidance 
person with at least one other school; and 38% had a teacher serving as a counsellor and 
guidance person on a part-time basis. Only 43% of the schools contacted reported that 
their school district employed a central counsellor and guidance administrator. 


Resource Materials 


Table 1 provides percentages of utilization and perceived effectiveness of resource 
materials in accessing occupational information. 

Calendars from postsecondary institutions were reported to be the most widely 
used and effective resource material. Career monographs, apprenticeship publications, 
and other professional publications ranked as the next most used material. 

Career information kits were rated higher in terms of perceived effectiveness than 
in utilization, an indicator that this type of material may have greater potential. 

The Canadian Classification and Dictionary of Occupations (CCDO), industrial, 
trade and business publications received a low rating of usage and effectiveness as a 

‘resource material. 


TABLE 1 


PERCENTAGES OF UTILIZATION AND PERCEIVED 
EFFECTIVENESS OF RESOURCE MATERIALS 
IN ACCESSING OCCUPATIONAL INFORMATION 


Wheabits Perceived 
Sue toile zation Effectiveness 
Apprenticeship Publications 77 78 
Calendars: Postsecondary institutions 94 90 
Canadian Classification and Dictionary of Occupations 50 Sy 
Career Information Kits 61 74 
Career Monographs 86 86 
Industrial Publications 42 BZ 
Professional Publications 66 68 


Pe le ek Te ep is a Be eee ee eS eS 
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Methods of Providing Occupational Information 

The percentages of utilization and perceived effectiveness of the methods of 
providing occupational information are shown in Table 2. 

Of the methods providing occupational information, individual interviews, 
displays and posters ranked highest. Guest speakers, vocational interest inventories, 
and work experience were also ranked high. Audiovisual materials, student interviews 
with employers, field trips and career days ranked low in utilization but were perceived 
to be an effective method of providing occupational information. 

The Alberta Department of Education’s explorations in career planning and work 
study received the lowest rating. Reasons for this low rating are not clear. 


TABLE 2 


PERCENTAGES OF UTILIZATION AND PERCEIVED EFFECTIVENESS 
OF METHODS OF PROVIDING OCCUPATIONAL INFORMATION 


Method Utili- Perceived 
zation Effectiveness 
Audiovisual Materials 55 73 
Career Days 40 55 
Displays and Posters 95 87 
Explorations in Career Planning: (Alberta Department) 33 41 
Field Trips 41 65 
Guest Speakers 74 84 
Individual Interviews 87 90 
Student Interviews with Employers 48 64 
Vocational Interest Inventories US) 74 
Work Experience 71 13 
Work Study 31 36 


Sources of Occupational Information 

Of the eight sources listed in Table 3, Educational Institutions were rated the 
highest, with 92% of the respondents indicating utilization and 76%, perceived 
effectiveness. The discrepancy might indicate that this source of information is being 
either over-utilized or misused. 

Ranked next are: Federal Canada Manpower Centres; Industry, Trade and 
Business; the Alberta Department of Education Counselling and Guidance Branch; 
Professional Organizations; and Provincial Career Centres. School Boards and the 
Chamber of Commerce were rated lowest. 
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TABLE 3 


PERCENTAGES OF UTILIZATION AND EFFECTIVENESS 
OF SOURCES OF OCCUPATIONAL INFORMATION 


lL 


Source of Information Utilization Effectiveness 
Alberta Education Counselling and Guidance Branch 54 EY) 
Chamber of Commerce 14 21 
Educational Institutions 92 76 
Federal Canada Manpower 69 61 
Industry, Trade & Business 59 58 
Professional Organizations Sul 55 
Provincial Career Centers 46 45 
School Boards 30 30 


ee 


Services to Facilitate Career Guidance 


Counsellor and Guidance Personnel Training. Concerning counsellor and 
guidance training, 87% of the respondents felt that personnel who provide students 
with occupational information need special training; 10% felt that personnel do not 
need training; and 3% did not respond. As to the value of a university or college course 
in training personnel to use available information effectively, 52% felt this would be of 
great benefit; 44% , of some benefit; 2% , of no benefit; and 2% , no opinion. 

The respondents were then asked to list areas of training they perceived as being of 
importance. Their comments were compiled under the headings: Information Services; 
Testing Expertise; Counselling Practicum; Career Development Theory; and other 
formalized Training. 

Counsellor and Guidance Personnel Experience. Regarding kinds of experiences 
other than formal courses that could be made available to counsellor and guidance 
personnel, 89% included Inservice workshops or Seminars; 48% listed Up-grading 
Courses; and 45% valued Work Experience. 

The respondents were then asked to list those types of personal experiences they 
perceived as being applicable to career guidance. Their comments were listed under the 
headings: Practical Experience in Various Fields; Knowledge of the Local Employment 
Situation; Experiences within the Educational System; Visitations; and Other 
Relevant Experiences. See Table 4 for respondents’ ratings of services to facilitate 
career guidance. 

Adequacy of Existing Programs. The data indicated that 50% of those polled felt 
that their school was providing an adequate program in helping students form realistic 
concepts of the world of work; 50% felt that their program was inadequate; and 10% 
did not respond. 

Those who responded negatively about their school’s career guidance program were 
requested to indicate the problematic areas in their own situation. Their comments 
were categorized under the headings: Administration; Time Restrictions; Finances; 
Staffing; Teaching Attitudes; Student Attitudes; Counsellor Training and 
Experiences; Informational Services; and Present Career Guidance Methods. 
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TABLE 4 


SERVICES TO FACILITATE CAREER GUIDANCE 


Facilitating Service Rating in Percent 


Counsellor Training 


Special training required 87 
Training not required 10 
No response 3 


Postsecondary Course in Using Information 


Of great benefit 52 
Of some benefit 44 
Of no benefit 2 
No opinion 2 


Counsellor Experience 


Inservice workshops or seminars 89 

Upgrading courses 48 

Work experience 45 
Conclusions 


Counsellors and guidance personnel rely a great deal on institution calendars and 
career monographs as major resource material about occupations. Apparently, they 
also feel this is an effective method of providing occupational information for students. 
Regarding methods of accessing occupational information, individual interviews, 
displays and posters were popular. 

It appeared that counsellor and guidance personnel value professional training and 
experience in a positive manner. They particularly identified the need of becoming 
more knowledgeable about occupational information. 


The availability of occupational information was not viewed as a severe problem. 
The main concern seemed to be the utilization of information, interpretation and 
effective distribution. 


The lack of practical experiences in the world of work on the part of counsellor and 
guidance personnel was obvious. Many indicated a desire for a variety of first-hand 
experiences in the world of work in order to communicate a more realistic occupational 
picture to students. They also desired a greater familiarity with both present 
employment opportunities and future projections of the labour market. 

Administrative problems such astimetabling, staffing, and finances seem to reflect 
the individual school’s philosophy towards the importance of occupational guidance. 
Also, teacher and student attitudes were viewed as further sources of problems in the 
effective treatment of occupational information. 
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Human Relations Training — 
The Effects of Methodology 


This study examined the effect of training methods on teachers’ satisfaction 
and personal growth. A random sample of 421 teachers in the public schools of 
Minnesota responded to a questionnaire designed to examine: (1) satisfaction 
with methods of instruction used in human relations training, (2) the effect 
training methodology had on personal growth, and (3) training methodology they 
perceived as most effective. Findings revealed that while the lecture method was 
used most extensively, methodology classified as simulation or on-the-job 
training was perceived as most effective in human relations training. 


Few would argue that teachers need to be humane, empathetic, and accepting 
in the classrooms. The need for educators to acquire training to develop more 
meaningful human relationships is well documented (Rogers, 1969; Jerslid, 1955; 
Dinkmeyer, 1971). There is considerable evidence that human relations training 
increases the effectiveness of teachers. Positive correlations have been found 
between the achievement levels of students and the level of understanding, 
warmth, and empathy displayed by teachers (Aspy, 1969; Kratochvil, Carkhuff & 
Berenson, 1969). Teachers who have had human relations training adapt better to 
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the classsroom setting (Hicks, 1975) and are more skillful in the areas of 
communication, problem solving, and decision making (Jerrems, 1971). It is also 
important to note that training in human relations increases teachers’ facilitative 
levels of communication and allows them to shape their perceptions and insights 
with respect to themselves and others (Israel & Savitsky, 1972). It appears that 
students can benefit through improved learning environments from teachers who 
have been trained in human relations. 


Although there are many studies dealing with the outcomes of human 
relations training, few focus on comparison of methods (Levin & Kurts, 1974) and 
it has not been established what element of the training causes behaviour change 
(Hipple, 1970, 1973). The trend, according to Landy and Trumbo (1976) is toward 
researching which techniques do work, not which methods are most effective for 
achieving objectives. 


The purposes of this study were to gather data relative to techniques 
presently being used in human relations training for teachers and to compare the 
effects of training methods on teachers’ satisfaction and personal growth. 


Method 


The study was conducted in the public schools of Minnesota during the 
1977-78 school year. As the result of a questionnaire survey of all fifty states, it 
was determined that Minnesota’s human relations training mandate offered the 
most comprehensive list of requirements and was representative of other states. A 
total of 1263 questionnaires were mailed to 57 randomly selected attendance 
centers. Forty of the 57 (70%) attendance centers participated in the study. Of 
the 488 questionnaires returned, 67 were completed by Minnesota teachers who 
had Lifetime Teaching Certificates. Since these 67 were exempt from the human 
relations training mandate, they were deleted from the study, leaving 421 (33%) 
usable questionnaires. More than half of the teachers who participated in the 
study were employed in grades 7-12, 57 percent were females, and the majority 
were between 20 and 34 years of age. Approximately eight of every ten 
respondents had received their human relations training from a four-year college. 


Procedures 

A cross sectional questionnaire was used to collect data from teachers. Twenty 
instructional methods highly related to the classroom environment were selected 
and grouped into three main categories: (1) information presentation, (2) 
simulation, and (3) on-the-job training. Teachers were provided a supplementary 
sheet describing each teaching method as well as a list of the 26 components 
mandated in the Minnesota program. For each of the 26 components, teachers 
were asked to indicate: (1) the method of instruction used to “teach” the 
component, (2) their satisfaction with the method of instruction, (3) the effect the 
training had on their personal growth, and (4) which method of instruction would 
be most effective for teaching that component. A rating scale of one to seven was 
used, with one representing a low and seven a high degree of satisfaction or 
personal growth. 

Analysis of variance was used to examine the effects of method of instruction 
upon teacher satisfaction and personal growth. Duncan’s Multiple Range Test was 
used to determine where significant differences existed. Significant differences are 
reported at the highest level, i.e., either .05 or .01. 
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Results 


The study had two major purposes: (1) to determine teachers’ satisfaction 
with methods of teaching used in human relations training, and (2) to determine 
the effect of methods of instruction on the personal growth of teachers. 


Mean scores for teacher satisfaction and effects of training may be seen in 
Table 1. Teachers expressed greatest satisfaction and reported greater personal 
growth with “other” as a teaching method. It should be noted the method “other” 
was used infrequently (.03%). They were least satisfied with programmed 
instruction, while reading lists provided the least personal growth. Lecture ranked 
sixteenth for providing satisfaction and seventeenth for promoting personal 
growth. Of the nine methods classified as simulation and on-the-job training, none 
ranked lower than twelfth for either dimension. 


TABLE 1 


RANK OF MEAN COMPARISON OF TEACHER SATISFACTION 
AND EFFECTS OF PERSONAL GROWTH 
WITH METHOD OF INSTRUCTION 


Rank of Degree of Satisfaction Effect upon Personal 
Methods (WV = 10,641) Growth (WV = 10,560) 
i Other 5.260 Other Si6 87) 
pas Case method 5.144 In-basket technique 5.425 
3 Task-centered games 4.807 Job rotation yoda Lil 
4 Task model 4.854 Case method 5, 082 
Dis Systematic observation 4.833 Programmed instruction 4.930 
6. On-the-job coaching 4.640 Task-centered games 4.843 
7 Committee assignment 4.582 Task model 4.667 
8 Job rotation 4.500 Committee assignment 4.665 
9. Conference-seminar 4.438 Role playing 4.410 
10. In-basket technique 4.367 Systematic observation 4.392 
HM Ke Role playing 4.340 On-the-job coaching 4.252 
Love Laboratory education AN SSS \7/ Incident method 4.220 
13's Reading list 4.259 Conference-seminar 4.005 
14. Incident method 4.250 Laboratory education 3.867 
Sie Sensitivity training 4.219 Sensitivity training 3.664 
16. Motion picture 4.116 Lecture 3.380 
ee Lecture SVS Motion picture SHS U7) 
er Closed circuit TV 3.588 Training by correspondence 3.075 
Lon Training by correspondence 3.333 Closed circuit TV 2.983 
20. Programmed instruction 36234 Reading list 2.921 
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ANOVA results shown in Table 2 indicate that method of instruction affects 
teacher satisfaction with human relations training. Satisfaction with methods 
differed significantly (p><.01). The Duncan revealed that there was no significant 
difference between simulation and on-the-job training but that both were superior 
to lecture and information presentation (p<.01). Lecture was reported as less 
satisfying than information presentation (p<.01). 


TABLE 2 


ANOVA SUMMARY TABLE FOR TEACHER SATISFACTION 
WITH METHOD OF INSTRUCTION 


Source af SS MS F 
Between groups 3 1,094.79 364.93 110.71%* 
Within groups 10,595 34,924.91 3250 
Total 10,598 36,019.70 


* Highly significant: .01 level. 


Table 3 shows that there was a highly significant difference (p<.01) relative 
to the effect method of training had on the personal growth of teachers. 


TABLE 3 


ANOVA SUMMARY TABLE FOR EFFECTS OF METHOD 
OF TEACHING ON PERSONAL GROWTH 


Source af SS MS F 
ee a ee Ee eee 
Between groups 3 PAP HAO LAL 756.87 Pata Westies 
Within groups 10,510 35,859.40 cial 

Total LO FSUS 38,130.00 


te) Serrilea)) Bolvsded) to eiwitewiiy 9) ee ee Ee ee 


* Significant: .01 level. 
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The Duncan revealed that the four methods all differed significantly from 
each other (p<.01). Simulation was reported as providing the greatest personal 
growth, followed by on-the-job training, information presentation, and lecture. 


Discussion 


The findings indicate that teacher satisfaction and personal growth are 
effected by the method of instruction utilized in human relations training. 


Teachers expressed more satisfaction and reported greater personal growth when 
methods classified as simulation or on-the-job training were used. This supports 
the conclusions reached by Argyris (1962), Schein and Bennis (1965), and Hipple 
(1973), although their research was not conducted with teachers. 


While simulation and on-the-job training were identified as the most effective 
methods for human relations training, lecture, the method used most extensively, 
was rated as least satisfying. The teachers also identified lecture as the method 
least effective for contributing to personal growth. 


The case method, in-basket technique, task centered games, and task model 
appear to be choice methods of instruction since they rate high for teacher 
satisfaction and personal growth. The method “other” was not considered in 
hypotheses testing since only a small number reported this method of instruction, 
but it had the highest ranking for both satisfaction and personal growth. Since 
those who reported it as a method of instruction failed to specify what the other 
methods were, it was impossible to draw any further conclusions. This is an area 
ripe for further study. 
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PERSPECTIVES 


Teachers’ Opinions and the 
Ideational Basis of Education 


Peter J. Miller 
The University of Alberta 


Among the more important, if little understood, determinants of the character of 
the school experiences of children are the educational opinions of their teachers. Such 
opinions do more than provide a rationale, perhaps even a justification, for a particular 
set of educational purposes and procedures. In determining the limits of the 
educational alternatives and possibilities envisioned by or acceptable to an educator, 
they act as an important causal factor in the initial selection of these purposes and 
procedures. They constitute, in fact, an ideology within which an educator is forced to 
think and act. Without an adequate understanding of the structures and character of 
teachers’ educational opinions, no full comprehension of the present character or 
possible future directions of education is possible. 


Educational opinions are, of course, uniquely personal and idiosyncratic, occurring 
in complexes that vary not only in content but in intensity. Attitudes and opinions, 
however, can be abstracted from the personalities of which they are a part and 
expressed in formal statements. They can thus be objectified and subjected to analysis 
and categorisation into clusters which indicate general orientations towards education. 
Some of these sets of opinions are so coherent that they can be expressed in a series of 
logically related statements from which can be derived specific recommendations 
concerning educational goals and practices. They can thus not inappropriately be 
referred to as educational theories. One can, then, using educational opinions as a basis, 
construct a typology of educational orientations and of educational theories current 
among prospective and practising teachers. 


This paper will develop a conceptual framework or matrix within which various 
categories of educational opinions can be identified. It will then report and comment on 
the results of research carried out at the University of Alberta designed to discover and 
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demonstrate the degree of affiliation among prospective and practising teachers to the 
orientations and theories identified. 


Two Dimensions of Educational Opinion 


There are many attitudes and opinions held by educators, aspects of which have 
relevance for their performance as teachers. Of special significance, however, are those 
which give expression to basic beliefs about the nature of the child and its relationship 
towards authority. These beliefs, although numerous and complex, are isolable and can 
be categorised into two opposing clusters, which in their antithetical relationship 
comprise a single dimension of educational opinion. At the centre of one cluster is the 
belief in the innate goodness of man and thus in the rightness or appropriateness of 
instinctive or impulsive forms of behaviour. Associated with these basic beliefs are a 
series of educational slogans proclaiming the desirability of open learning 
environments, play activities, motivational needs, freedom of expression, and the 
child’s right to make choices. The process and task of education is thus defined as one of 
making manifest whatever is latent within a child. The antithetical cluster has at its 
heart the denial of any innate goodness in man and consequently stresses the 
importance of acquired behaviours. It has thus spawned its own set of educational 
slogans stressing the importance of structured learning situations, work activities, 
prescriptive needs, the child’s right to receive what adults determine is appropriate, 
and, above all, discipline and control. It thus rejects the view of education as 
manifestation and views it instead as a process of acquiring knowledge and behaviour 
which are external to the individual. The two clusters and the dimension of 
educational opinion they define may be characterised by the terms “child-centred” and 
“teacher-centred.” The educational dualism they represent is a familiar one to 
educators and can be traced, in its present form, back to the educational literature of 
the mid-eighteenth century. The persistence with which educators over the past 250 
years have continued to express their differences in the terms of this dualism suggests 
that its treatment as a primary dimension of educational opinion is justified (Dewey, 
1902, pp. 3-10). 

Educational opinion, however, splits on dualism other than that of “traditional” 
teacher-centred beliefs as opposed to the “new” child-centred ones, and on none more 
profoundly and significantly than that concerning the nature of knowledge and related 
questions such as the status of science and scientific inquiry. Indeed, a teacher’s beliefs 
concerning the nature of reality, of truth, of knowledge are as fundamental to the 
formulation of educational purposes and procedures as those concerning the nature of 
childhood. These opinions, expressed in formal statements, can also be grouped into 
two clusters of related beliefs which comprise a second dimension of educational 
opinion. The clusters and the general orientations they represent may be designated 
“toughminded” and “tenderminded.”! The discrete yet logically related elements of 
toughmindedness include a commitment to materialism and an associated embracing 
of empiricism and determinism. Objectivity is thus viewed as a necessary and desirable 
characteristic of modes of inquiry, of knowledge, and of professional relationships. 
Control over the emotions (particularly over those of love and hate), dispassion, 
detached judgment — these are the hallmarks of understanding, of correct decisions, of 
appropriate behaviours. The principal elements of tendermindedness, on the other 
hand, are antithetical preferences for transcendentalism, intuitionism, free will, and 
subjectivity both in knowing and in determining the nature of human relationships. 
Passionate involvement and freeflowing emotions are seen not as barriers but as 
avenues of truth, knowledge and right action. The dualism represented by these two 
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clusters of opinions is one of the oldest in the western intellectual tradition. It can be 
seen in one form in the differing kinds of knowledge and truth pursued in Plato’s 
Academy and Aristotle’s Lyceum, and is still evident in the inability of C.P. Snow’s two 
cultures to understand each other fully. Whatever the origins of the dualism, it is clear 
that the educational issues raised by the conflict between its elements are fundamental 
ones, and perhaps even more generative of educational differences at a theoretical and 
practical level than the previous one. 


A Matrix of Educational Opinion 


While, in one sense, the two antithetical clusters of opinions which comprise a single 
dimension are logically discrete, it is possible to conceptualise them as representing the 
extremes of a single range of possible sets of opinions. Between extreme 
child-centredness and extreme teacher-centredness, for example, there are an 
indefinite number of intermediate positions, each representing a certain degree of 
inclination towards one or the other. The dimension can, in fact, be represented by a 
single scale of educational opinion. 

The two scales and the dimensions of educational opinion they represent are, as 
has been suggested above, theoretically independent. This assumption of an 
orthogonal relationship between the two dimensions makes possible the creation of a 
symmetrical matrix within which various categories of educational opinion can be 
established.? The particular character of an individual’s educational opinions can, in 
fact, be established by determining the appropriate point on each of the two scales 
and plotting the resultant position within the matrix. And while there are an 
indefinite number of positions that can be occupied within the matrix, it is also the 
case that certain of these share a common area of the matrix and can thus be 
categorised. Moreover, if within a general population the distribution of scores 
within each scale is conceived of and represented as a normal curve, it is possible not 
only to categorise opinion but also to determine the relative frequency with which 
each category will occur within this general population. These categories of 
educational opinion and their relative popularity are depicted in Figure 1, where S 
represents one standard deviation from the mean on a normal curve. 

Within a general population, then, some 46% will possess no strong educational 
opinions as defined by the matrix. Such a category will presumably be comprised not 
only of those who are indifferent to the appeal of the slogans inherent in each set of 
clusters, but also of those who have succeeded in combining in a unique personal belief 
system elements from the antithetical clusters of each dimension. There is, of course, 
no logical reason why a personal educational belief system should be constructed 
according to the logic and particular structure of the matrix described above. 
Nevertheless, in terms of the matrix, the belief systems of this group may legitimately 
be said to be characterised by indifference or confusion or both. 


Forty-four percent of the total population will demonstrate a strong set of 
preferences along one of the dimensions but show no strong commitment along the 
other. This particular type of belief system is referred to in Figure 1 as an educational 
orientation. Thus, the freedom orientation, authoritarian orientation, science 
orientation, and humanistic orientation will each contain approximately 11% of the 
total general population. 


The remaining 10% will show a strong commitment along both dimensions and 
thus be categorised into one of the four theoretical affiliations identified in Figure 1. 
The association of child-centredness with toughmindedness is characteristic of 
affiliation to a theory of pedagogical naturalism as conceived and expounded by 
Rousseau, Dewey, and Piaget. Child-centreness combined with tendermindedness can 
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be categorised as educational romanticism as expressed in the literary works of William 
Blake and Charles Dickens, by modern client-centred therapists like Roger and Laing, 
and by the neo-Freudian freeschoolers, Homer Lane and A. S. Neill. Commitment to 
teacher-centredness and toughmindedness are at the heart of the educational 
recommendations of nineteenth century utilitarians like James Mill and Jeremy 
Bentham, and of twentieth century behaviourists like J.B. Watson and B.F. Skinner. 
Finally, tendermindedness combined with teacher-centredness represents an orthodox 
religious view of the child and his education, as expressed, for example, by Jewish, 
Roman Catholic, and Christian fundamentalist educationists. This category is referred 
to as a Christian humanistic affiliation. We may expect to find, however, only about 
2.5% of the general population possessing an affiliation for any one of these general 
theories. 
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Figure 1. Category distribution of educational opinions within an ideal type general 


population. 
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It is important, at this point, to make a further distinction between an educational 
orientation and affiliation to an educational theory. The former term refers to sets of 
opinions which are essentially one-dimensional — that is, which reflect a strong 
commitment within one dimension but little or no preference within the other. Thus, to 
subscribe to the beliefs associated with child-centredness with no corresponding 
commitment to either tough or tendermindedness constitutes an educational 
orientation. While such an orientation is important in setting certain limits to a range 
of preferred educational goals and practices, it does not in itself contain any clear and 
unequivocal educational implications. It will, rather, tend to generate a set of 
educational slogans, empty of precise meaning, specifying no education particulars 
(Komisar & McLellan, 1965, pp. 200-1). Affiliation to an educational theory, on the 
other hand, refers to those sets of opinions which reflect a strong commitment along 
both dimensions. In such cases, concepts acquire a meaning precise enough to generate 
clear and coherent educational implications. Thus openness in education (a preference 
clearly associated with child-centredness) remains a mere slogan until it is associated 
with either tough- or tendermindedness and becomes an element in a general theory of 
Naturalism or Romanticism. In the former case, it will express itself in the kind of 
learning environment constructed by Dewey in his Chicago experimental school; in the 
latter, it will imply the kind of learning situations that arose in A.S. Neil’s Summerhill. 
Similarly the idea of progress (a concept inherent in toughmindedness) has no precise 
meaning or educational implications until associated with child-_ or 
teacher-centredness — until it becomes, in fact, Dewey’s evolutionary based notion of 
growth or the inevitable result of Skinnerian social engineering. 


Within a general population, then, we may expect to find a relatively large number 
(46%) of individuals who have no strong educational orientation or theoretical 
affiliation, an almost equally large number (44%) who possess a clear educational 
orientation, and a relatively small number (10% ) who are affiliated to a comprehensive 
educational theory. Of course, in the real world, there is rarely, if ever, to be found an 
ideal type general population which can be categorised so neatly, precisely, and 
symmetrically. Nevertheless, a concept of such an ideal type can function as an 
important frame of reference for an analysis of the opinions of a particular population 
group — for example, the teaching profession. 


Ideational Structure and Teachers’ Opinions 


Among the several important population groups whose opinions, as it were, lie 
behind and affect educational practice, that of prospective and practising teachers 
must be considered of special significance. Not only does this particular population 
control the day to day practice of education; equally important, in view of its training 
and experience, we might, resonably expect that its knowledge of the contemporary 
educational process would be more intimate and reliable than any other group, its 
understanding of the theory underlying the practice more sophisticated, and its 
opinions better informed. Indeed, as a source of insight into the present character of 
public education; the educational opinions of teachers, both prospective and 
practising, must be considered primary data. In ascertaining and considering these 
opinions, it is important to make a distinction between those beliefs which may be 
said to reflect a personal stance towards education, and those which are attributed by 
the individual to the educational organisation or institution within which the 
individual is to operate. The former is a view of the ideas which it is believed ought 
to govern educational practice, the latter a perception of its actual theoretical 
underpinnings. Both types of opinion, however, are important in revealing 
something of the ideational basis of contemporary education. 
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In an attempt to discover something of the nature of both types of opinions, and to 
compare in this respect the teaching profession with an ideal type general population, a 
questionnaire was designed utilising the conceptual model described above to providea 
measure of prospective and practising teachers’ preferences along each of the two 
dimensions of the matrix (Miller, 1974). The resulting scores located the belief system 
of a particular individual at a point within the matrix and thus enabled its 
categorisation into one of the nine types discussed above. In addition, each respondent 
was also asked to give his or her opinion of the position adopted by the system of 
education towards each of the questionnaire items. Thus, a further measure was 
obtained of teachers’ perceptions of the ideational foundations of actual educational 
practice. The questionnaire was administered to 520 prospective and practising 
teachers registered in courses at the University of Alberta. Several analyses of data 
were carried out and several significant results obtained. 


To begin with, consider the category distribution of teacher personal belief systems 
illustrated in Figure 2. There are two observations about the category cell distributions 
illustrated in Figure 2 which are of particular significance. The first concerns the 
disproportionately large number of respondents (58%) who demonstrate no 
orientation or theoretical affiliation. The most important factor inflating the number 
in this category is clearly the relative weakness of the science (1.7% ) and humanistic 
(7% ) orientations.’ Both are significantly underrepresented, the former markedly so. 
One can only speculate about the reasons why the alternative views of truth and 
knowledge represented by the tough/tenderminded dualism possess so few strong 
adherents among teachers. It may be, of course, that the relatively low standard 
deviation within this scale (7.5 as opposed to 9.9 on the child/teacher-centredness 
scale) is a function of conceptual confusion among the questionnaire items. It seems 
more likely, however, that it is a reflection of a comparatively weak interest among 
teachers in epistemological questions and a corresponding strong interest in issues that 
appear to be methodologically relevant (Miller, 1976). Whatever the causes, when one 
considers the teaching profession’s lengthy period of training and its primary role in 
the educational enterprise, this relatively low level of ideational or theoretical 
commitment among teachers is somewhat disappointing if not disconcerting. 

The second observation is not quite so obvious but perhaps more interesting and 
certainly more revealing of the structure of contemporary educational theory. It will be 
noted that despite the relative weakness of the science (1.7%) and freedom (4.8% ) 
orientations, the theoretical affiliations to Behaviourism (3.3%) and Romanticism 
(4.8%) are disproportionately strong. What is at work here is a powerful association 
among teachers of two logically discrete clusters of beliefs, the preference for 
toughmindedness being linked with teacher-centredness, that of tendermindedness 
with child-centredness. Put simply but not incorrectly, science is seen as being tied to a 
control orientation, while a humanistic orientation is seen as liberating. Conversely, the 
virtual complete absence of a Naturalistic affiliation (0%) and the relatively low 
representation of a Christian Humanistic affiliation (1.7 % ) indicate a related tendency 
to disassociate the science orientation from a freedom orientation, the control 
orientation from a humanistic orientation. In the current climate of education opinion, 
it appears that the major theoretical affiliations of educators are to Behaviourism, with 
its apparent commitment to the scientific study and treatment of man and its support 
for such values asexternal control, achievement, conformity; or to Romanticism, with 
its antithetical commitment to the subjectivity of knowledge and the values of 
self-discovery, self-expression, self-fulfillment. Even if there is an awareness of, there is 
certainly no belief in the alternatives offered by Naturalism, with its Deweyan 
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commitment to freedom through participatory democratic decision-making and 
liberating objective scientific inquiry, or by Christian Humanism, which insists upon 
the necessity of self-control, based essentially on self-knowledge and an inherently 
pessimistic, if not tragic, view of social life.‘ 
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Figure 2. Category distribution of teachers’ educational opinions. (Total n=520) 


It has been noted that a disproportionately large number of teachers show no strong 
orientation or affiliation and that those who do have clear theoretical commitments are 
attracted mainly to Behaviourism or Romanticism. These findings, however, concern 
only the personal belief systems of teachers; they tell us nothing of their opinions 
concerning the ideational foundations of the actual practice of education as they have 
experienced or, indeed, conduct it in their classrooms. The category distribution of 
teachers’ opinions concerning the beliefs underlying the system of education is given in 
Figure 3. 


Once again, two principal observations may be made about this distribution. First, 
there are proportionately very few teachers (14.6% ) who view the system as lacking a 
strong ideational basis. Whatever the strength and coherence of their own educational 
beliefs, they perceive no lack of either in the theoretical foundations of the educational 
practice approved by the system they are involved in. 
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Figure 3. Distribution of teachers’ perceptions of the ideational foundations of the system of 
education. (n=520) 


Secondly, there is clearly evident a high degree of agreement concerning the actual 
nature of this strong ideational underpinning of the educational system. A high 
proportion of teachers view current educational practice as based either upon a control 
orientation (51.1%) or upon an affiliation to to Behaviourism (26.7 % ). The other three 
orientations and three affiliations account for only 7.5% of responses. That teachers 
should view education within a system of compulsory schooling as reflecting a control 
orientation is hardly surprising. What is more interesting and significant is the 
strength of the perceived theoretical affiliation of the system. Once again, there is 
apparent an association, in this case an extremely powerful one, of toughmindedness 
with teacher-centredness. Indeed, so strong is the association of science with control 
that the Naturalistic affiliation and science orientation are completely unrepresented. 
The educational system, then, tends to be viewed as a concrete manifestation of 
society’s commitment first to the truths reached by the scientific study of man and 
nature, and second to the maintenance of order and control both within and over the 
individual and society. In this sense, the system is seen to operate in accordance with its 
own internal logic; whatever its faults, it is seen to possess the virtues of coherence, 


consistency and clarity of purpose. 
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The perceived current strength of the control orientation and behaviouristic 
affiliation in education ought not to be surprising and, indeed, is probably no greater 
now than it was when Dickens satirised it so bitterly in his Hard Times almost a 
century and a half ago. From its earliest beginnings, the public school has been linked 
ideologically with the factory system and industrial mode of production, themselves 
dominated by an ideational commitment to science, technology, and social control 
(Miller, 1973). Given a theoretical basis by utilitarian ethical theory in the nineteenth 
century and an experimental foundation and fresh vocabulary by psychologists in the 
early twentieth century, the behaviouristic domination of educational opinion and 
practice has proven to be virtually unassailable. The vision of democracy and education 
held out by Dewey, the much discussed but little tried social and educational ideals of 
progressive education, the eloquent criticisms voiced by the romantic imagination, 
even the various degrees of a Christian presence in public education — none have 
succeeded in weakening what now appears to be perceived as a monolithic ideational 
structure of public education. Nor do the personal educational opinions of teachers 
appear to pose any immediate threat to this ideational foundation of education. 
Indeed, the large proportion (58%) of teachers having no orientation or theoretical 
affiliation, the disproportionately strong control orientation (18.4%), and the 
disproportionately strong association of the concepts of science and control (3.3%) 
suggest that in the foreseeable future, the behaviouristic affiliation will continue to be 
perceived as the dominant force in education. 


In conclusion, it should be noted that, as is always the case in attitude and opinion 
studies, one must be especially cautious about suggesting any direct relationship 
between stated beliefs and actual behaviours. We know little if anything of the 
relationship between category membership and classroom performance, and nothing of 
that between a teacher’s perception of an educational system and the actual theory and 
practice in operation within the system. Nevertheless, despite and in the face of such 
disclaimers, for the few despairing voices that have been raised in the wilderness of 
proliferating behavioural competencies and more refined control techniques, the lack 
of any serious challenge to the dominance of behaviourism revealed by an analysis of 
prospective and practising teachers’ educational opinions presents a most dismal 
prospect. 


Notes 


Vp The opposing sets of beliefs described here have been commented on in a number of contexts. Of particular significance 
for the present study, however, are the distinctions made by William James between the optimism of healthy 
mindedness and the pessimism of the reflective sick soul (James, 1971, pp. 92-171), and that made by P. Sorokin in his 
analysis of the evolution of culture between the “sensate” and “ideational” modes of knowledge and truth (Sorokin, 


1957, p. 311). 
rR See the discussion and application of Lewin’s concept of “phase-space” (Lewin, 1951) by Jackson (1959). 
3 The freedom orientation (4.8%) is also significantly underrepresented, but is more than compensated for by the 


disproportionately strong control orientation (18.4%). Together they comprise over 23% of the total responses, 
compared with 22% of an ideal type general population. 
4, See, for example, the two alternatives offered educators in A Choice of Futures (Worth, 1972, pp. 26ff). 
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Ethics is an aid to moral navigation. We can sail without it if we wish. But we ignore it 
at some peril. (Hodgkinson, 1978, p. 220) 


The few educational administration texts which consider the subject of ethics 
tend to discuss the importance of ethical behaviour rather than to facilitate the 
learning of it. In fact, one looks in vain for the word “ethics” in either the table of 
contents or the index of A Values Approach to Educational Administration 
(Ostrander & Dethy, 1973), a text dedicated to the importance of one’s personal 
value system and philosophic posture in guiding his administrative behaviour, and a 
search of the UCEA Review for the past dozen years turns up little more than some 
passing references to the incorporation of ethics into preparation programs 
(VanGeel & Immegart, 1978; Gibson & Stetar, 1976; McIntyre, 1970; Report on 
Syracuse University’s Seminar, 1970; Nunnery & Kimbrough, 1969; Reference to 
videotaped discussion, 1968). Available literature provides almost no guidance on 
how to prepare educational administrators for ethical practice. 


While the present article is an initial attempt toward filling this gap, it ignores 
some pertinent topics. For example, it does not dwell on the importance of ethics 
to human behaviour generally or to contemporary educational administration in 
particular. The conditions in society that are causing those in the medical, legal, 
business, and other professions to become increasingly concerned with ethics are 
well recognized. Concern in educational administration has been heightened by 
such developments as the stress on accountability and efficiency; the use of 
sophisticated technology in such areas as information control and chemical, 
electronic, psychological, and biological approaches to trait and behaviour 
modification; and the need, as Gross (1978, p. 293) has indicated, for the school, 
its administration, and its teachers to be “models of ethical conduct ” because of 
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the decline in other traditional sources of values such as the home, the church, 
and the community. These concerns require no elaboration for present purposes. 


Nor is there an effort here to distinguish definitionally such related concepts as 
ethics, values, morals, sensitivity, humanism, or philosophy — except to emphasize 
that ethics (like morals) are not identical to values. In this regard, Sergiovanni and 
Carver (1973, pp. 18-22) have built helpfully upon Huebner’s work by presenting a 
taxonomy of values, in which ethical values comprise one of five systems (the other 
four being technical, political, scientific, and aesthetic values). Among others who 
have tried to differentiate between these two concepts, Berkson (1968, p. 51) 
suggests that ethics derive from patterns of values; Blackmon (1977, p. 51) states 
that “ethics are related to a profession as values are to an individual,” with which 
one may disagree in the belief that there is a personal component to ethics that 
transcends any given profession (although the operationally useless “clotheslines” of 
generalities that typically pass for “codes of ethics” indicate that most professional 
associations concur with Blackmon’s view); and Peters (1966) remarks that “what is 
distinctive of ethics as a branch of philosophy is that it is concerned with the 
analysis and justification of answers to practical questions where ‘practical’ is 
contrasted with ‘theoretical’ ” (p. 17). 


Definitions aside, what this article does address are the following three 
questions: 
1. Can and should ethics be included in the formal preparation of educational 

administrators? 

2. To what extent and in what ways do current programs include ethics? 
3. What further development, if any, should be pursued in this area? 
Thus, attention is focussed on the arguments, the realities, and the prospects 
regarding ethics in the preparation of educational administraters. 


The Arguments 


Can and should ethics be included in the formal preparation of educational 
administrators? Accepting the view that ethical competence (the will and ability to 
act professionally in accordance with high moral principles) is essential to 
educational administration today does not necessarily imply a positive answer to 
this question. Several issues must first be carefully considered, and it must be 
recognized that the topic is the development of ethical competence in prospective 
educational administrators — not values clarification (although the process of 
introspection is relevant), not sensitivity training (although human relations skills 
are relevant), and not the study of school law (although the concept of justice is 
relevant). All of these relatively common training components have some 
relationship to the development of ethical competence, but none of them is directly 
addressed to that task. The concern here is with the possibility of something 
different — with a consciously focussed endeavour, during formal preparation, to 
develop ethical competence. 

The first argument to be considered revolves around whether or not the effort is 
worthwhile at all. On the one hand, Donald Thomas (1974) has written: 

It is difficult for men and women to learn to be ethical; it is not a skill to be developed. 
Rather it is the man or woman in total; it is the sum of one’s values, beliefs, and 
priorities. It is the way one accounts for his life and his own worth. The choice is not 


difficult if one is by habit an ethical person. (p. 50) 
Thomas is not alone. Bok (1976) implies that there are many who believe that moral 


development involves acquiring proper moral values and achieving the strength of 
character to put these values into practice, matters that are not easily taught in a 
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classroom; this point of view accounts for the statement of a business school 
spokesman who, in explaining why there were no courses on ethics in his curriculum, 
said: “on the subject of ethics, we feel that either you have them or you don’t” (Bok, 
1976, p. 29). 

The opposing side has been taken by Erwin Miklos (1978), who provides 
examples of circumstances under which administrators must inevitably face ethical 
and moral problems and who calls for training that would help them grapple with 
these problems. Among the circumstances he presents are manipulation, particularly 
through the control of communication networks and the management of information 
flows; deception, which he says “is a component of accepted administrative 
practices” (p. 3); accountability, which administrators may seek to evade; and 
considerations about the very quality of organizational life. He concludes that 
“ethical and moral considerations are more pervasive in administrative 
decision-making action than is reflected by scholarly activity in the field of study 
and by the substance of preparation programs” (p. 3), and he raises the possibility 
that “ethics in administration could become acceptable as an area of specialization 
in the same way as the economics of education or the legal aspects of administration 
are now accepted” (p. 4). 


Particularly convincing to the present author is the position taken by Harvard 
University’s President. In discussing problem-oriented courses designed to 
encourage students to define their moral values more carefully and to understand 
more fully the reasons that underlie and justify these precepts, Derek Bok (1976) 
writes: 

Unless one is prepared to argue that ethical values have no intellectual basis whatsoever, 

it seems likely that this process of thought will play a useful role in helping students 

develop a clearer, more consistent set of ethical principles that takes more careful 
account of the needs and interests of others. And it is also probable that students who 
fully understand the reasons that support their ethical principles will be more inclined 
to put their principles into practice and more uncomfortable at the thought of 
sacrificing principle to serve their own private ends. (p. 29) 
This supposition that ethical values have an intellectual basis is returned to later. 

A second argument, assuming that one does decide to try and incorporate ethics 
into preparation programs, concerns whether one should teach moral absolutes or 
espouse ethical relativism. On the one hand, one finds the view expressed in The 
ATA Magazine (Samoil, 1978) that support for teaching moral absolutes is a 
rejection of complexity in favour of “the comfort and imagined security of a 
unilateral approach to life”: 

Are we prepared to teach that only one set of values is appropriate for everyone in any 

situation? If so, which code of morality is it to be? Who will decide whether the schools 

should teach only moral absolutes and which principles will be taught? 


Is society in such a state of insecurity that it cannot tolerate divergent ideas and values? 
(p. 13) 
The absolutists, on the other hand, reject this “tolerance” school of ethical thought, 
not only because it denies the existence of objective standards of morality, but also 
because they view the relativist line of argument as specious. Michael McGrath 
(1969) puts it this way: 
The position is something like this: ethical relativism means that one cannot say that 
certain human values are objectively superior to others; therefore, my values are not 
objectively superior to those of other men; therefore I am obligated to respect the values 
of all men as much as my own. Although it is not usually stated this explicitly, such an 
argument is clearly fallacious in that the conclusion is inconsistent with the premise. 
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The premise states that no particular values are objectively superior to others; yet the 
conclusion seems to be saying that one value, tolerance, is superior to others. What is 
surprising is how many modern intellectuals accept such an argument and so delude 
themselves as to the consequences of ethical relativism. (p. 49) 
McGrath concludes that, even though ethical judgments may be relational, this is 
not a sufficient reason to deny the possibility of ethical knowledge (p. 52). 


Finally, there is the argument that involves the role of the professor, vis-a-vis the 
students, in teaching ethics. At issue here is the concern that program components 
dealing with moral issues will become vehicles for transmitting the private 
prejudices of the instructor. Harry Randles (1977), in a personal statement prepared 
for the NCPEA Continuing Interest Group on Values and the Humanities, seeks to 
avoid the problem by relying on the notion that his students are individuals who 
“have goals and aspirations which are uniquely their own.” He goes on: “Acceptance 
of them as people implies acceptance of their goals and aspirations. Consequently, I 
feel a strong obligation to assist them to achieve their objectives. . . .I am obligated 
not to have a plan ‘to improve’ them” (p. 13). 


There are at least two difficulties in sustaining this point of view. First, unless 
the instructor completely abdicates his professional responsibility as leader of the 
learning experience, he cannot avoid some degree of “steering,” if only through his 
selection of readings or other content and activities to be included in the learning 
experience. Secondly, if he endeavours to remain completely neutral, he is modelling 
inauthenticity and it would, in effect, be unethical of him to expect his students to 
behave authentically under these circumstances. Thus, his means would be in 
conflict with his ends. Accordingly, if a professor endeavours to develop ethical 
competence in administrator preparation, he should be willing to expose his own 
ethics (which Randles [1977] is prepared to do, but on an “equal” basis). 


The above examples of important arguments surrounding the question of 
whether or not ethics can and should be included in the formal preparation of 
educational administrators are sufficient to demonstrate the complexity of the 
question. The author’s answer to it is that an effort to teach ethics should be made 
— carefully. The subsequent question is: How? Before an answer is attempted, a 
review of what is being done in contemporary practice is presented. 


The Realities 


To what extent and in what ways do current programs include ethics? The 
author recently conducted a survey to determine whether or not major university 
Departments of Educational Administration are offering learning opportunities 
concerned with ethics and, if so, why, what, for whom, how, by whom, and with what 
results? The institutions contacted were those holding membership in the 
University Council for Educational Administration during 1978, on the assumption 
that responses from them would provide a reasonable reflection of the “leading edge” 
in this field. 

Eighteen replies were received from the 48 universities contacted, a response rate 
of 37.5%. One suspects that the vast majority of those who did not reply had 
nothing to say on the subject. Of those that did reply, the comments of seven may be 
summarized as follows: ethics is an important topic and a neglected one; we don’t do 
much, or anything, specifically in this area at our institution. If this group is 
combined with the nonrespondents, it might be concluded that almost 
three-quarters of the universities contacted pay no conscious attention to the subject 
of ethics in their administrative preparation programs. The few reasons given by this 
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group of respondents include a lack of relevant competence by educational 
administration faculty members, and the availability of sufficient ethics-related 
courses elsewhere on campus (notably in Educational Foundations and Philosophy 
Departments, and in Business, Law, and Medical Schools). Other common remarks 
suggest confusion about what is meant by “ethics,” and some concern for ethics in 
training educational researchers (but not educational administrators). 


The remaining eleven responses fall into two basic groups — those reporting 
that the subject of ethics is integrated throughout much of their preparation 
programs, and those reporting distinct program components designed deliberately 
to focus on ethics. The majority (seven) of these are in the first group and most of 
them indicate that, to the extent that ethics are treated in their programs, they are 
included incidentally. The following statement is characteristic of this group: 

I subscribe to the belief that ethical knowledge cannot be taught as can 
computer-programming or time-tabling, since ethics have to do with insight and 
judgment and not with the manipulation of things. It is thus inappropriate to deal with 
ethical issues in “packages, entrance standards, topics covered, main references,” and so 
forth. So far as I can determine, there has been little empirical success to date in the 
field of clarifying values for students or developing them morally — how then, can 
ethics, which is a subject far more complex than a simple question of morality, be dealt 
with by means of courses, in-service training, texts? Ethics is, in essence, expressed not 
simply through dogma but also through being — it cannot be learned through specific 
courses taught in school. The most that formal courses can do is raise an awareness of 
ethical issues. 
Respondents in this category claim that matters related to ethics arise naturally in 
virtually all aspects of their preparation programs wherein the primary focus is on 
some other subject; ethical concerns intrude during practical or clinical experiences, 
during discussion of cases and simulations, and during regular courses through 
considerations such as ethical responsibilities in the supervisory relationship, ethical 
uses of personal data, and ethical business relationships. Thus, in these programs to 
the extent that ethics are treated it is done in an effortless and incidental way. 

Two institutions in this group, however, reportedly endeavour consciously to 
integrate ethics into much of what they teach. Illustrative of this category is the 
following comment from one professor in a Canadian institution: 

I would characterize my approach to ethics in administration preparation as requiring: 


I: First — the identification and naming of ethical issues arising from real situations 
reported by the students (not artificial cases or exercises) 

24, Second — the exploration of the alternative values associated with different 
courses of action regarding the identified situations 

a Third — the presentation of my position on an appropriate course of action — 


along with an accompanying rationale. Of course, a discussion and encouragement 
of other viewpoints is mandatory. 
This approach to considering ethics consciously as tangential to other primary foci 
of courses is dependent almost entirely on the interests and capabilities of individual 
professors. 


The third and smallest group of responses came from four universities which 
have components within their administrator preparation programs that are 
deliberately and distinctly focussed on the subject of ethics. There follows a brief 
descriptive summary for three of these, beginning with the most deductively 
oriented and concluding with the most inductively oriented. 


At the University of Minnesota, Sam Popper has developed a graduate seminar, 
which is cross-listed with the Classics Department, entitled “Applied Humanism in 
Administrative Behavior: The Classical Perspective.” He and a professor of Classics 
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have also been testing this kind of seminar as an inservice offering through six 
three-hour sessions with superintendents in the field. The seminar encourages a 
search for linkages between Classical antiquity and contemporary problems 
confronting educational administrators, and a core graduate course in educational 
administration (“The Social Organization of Schools”) is a prerequisite for it. The 
themes selected for the seminar “point to a range of contemporary problems in 
American society which impinge upon administrative decision-making behavior and 
whose antiquity is attested to by their recurrence in Classical literature.” The 
seminar themes include: social contexts of value formation, valuing, and value 
change; the concept of the “polis”; ends-means reversal; concepts of power, influence, 
and authority; role expectations vs. human needs; recurring adaptation of a group to 
external pressures, internal redefinition of function, and transaction; nonverbal 
communication and the authenticity of behaviour — man the manager of 
impressions; role-motivated behaviour in formal organization; bureaucracy in 
antiquity and the “ideal type” of Max Weber; applied technology, moral restraints, 
and enlarging definitions of authority; the social function of myth; the tragic in 
human enterprise; the Saturnalia spirit. With each theme, relevant works from the 
Classics are examined for application to administrative behaviour in education 
today. To facilitate access to the Classical literature, a resource book of excerpts has 
been prepared. 


At the University of Colorado, Russ Meyers consciously attempts, in teaching 
the Introductory course, to identify ethical issues, examine implications for 
administrative behaviour, and ask the individual student to recognize the 
relationship between that behaviour and the ethical issues involved (although he 
does not deliberately try to develop any specific ethical knowledge, skills, or 
attitudes). He draws on a few novels, plays, and films chosen because they portray 
significant moral dilemmas; these are supplemented by a number of one-page 
administrative incidents — summaries of actual happenings with value implications 
collected from a variety of sources — which are used as bases for class presentations 
and analyses. In addition, he has adapted a number of brief paragraphs describing 
situations and actions in administrative practice which clearly involve ethical 
problems; the students are asked to indicate the extent to which they approve or 
disapprove of the actions taken and to explain their responses. This course 
represents a combination of the inductive and deductive approaches to ethics in 
administrative preparation, but it is not clear how effectively it articulates these two 
approaches. 

Finally, the University of Wisconsin at Madison recently introduced two 
annually offered, semester-long courses. While not required, they are assigned to a 
category of courses from which students must make choices. One of the courses, 
Educational Ethics, is taught by two philosophers and offered through the 
Department of Educational Policy Studies; it involves “a philosophical examination 
of selected moral issues facing school personnel.” The course content addresses 
issues related to dealing with young people in schools, personnel practices, the 
distribution of resources, and the contribution of philosophy to the solution of 
practical problems. Readings are drawn from works on educational institutions and 
philosophical ethics. This course must be taken prior to or concurrently with the 
other new one, Ethics in Educational Administration, which involves “the study of 
ethics applied to problems in educational administration, with special emphasis on 
analytical development of ethical rationales through case studies.” The content of 
this course deals with the context of ethical problems and moral dilemmas in 
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educational administration and involves the analysis of case studies in the 
administration of public schools, vocational-technical education, and _ higher 
education. The course derives from the beliefs that ethical competence’ is essential to 
educational administration today and that neither intuitive personal judgments and 
opinions nor science and systems analysis are sufficient for the development of this 
competence. So the instructors seek consciously to prepare their students “for the 
application of analytical skills in ethical interpretations of academic and 
professional problems.” The course draws on the growing body of literature about 
values, morals, and ethics in educational administration, and it relies heavily on the 
analysis of case studies which portray ethical dilemmas in the field. 


In summary, this informal survey suggests that universities display a variety of 
approaches to the treatment of ethics in preparatory programs for educational 
administration — ranging from institutions where the subject is virtually ignored, 
through those in which it is treated (either incidentally or consciously) as a 
nonseparable subject that arises naturally in most of what is taught, to a few 
universities which have deliberately established distinct components on ethics in 
their administrative preparation programs. 

One might identify four alternative rationales underlying this variety of 
responses. First, ethics cannot be taught (it is a matter of one’s experience and 
values and the interaction of the two); this rationale for training relies on accident. 
Secondly, ethical sense can be developed by helping one to observe and analyse the 
behaviours of others and to clarify and understand his own values; this rationale for 
training stops at awareness. Third, ethical behaviour can be enhanced by conscious 
efforts to practise it through clinics, practica, and reality-based cases and 
simulations; this rationale for training proceeds to action, but with emphasis on 
viable and situationally determined alternatives. Finally, there are certain value 
systems and ethical principles that are desirable in the kind of society we live in or 
aspire to, and these can and should be inculcated within the moral philosophy which 
guides the behaviour of educational administrators; this rationale for training 
extends into advocacy. Responses to the author’s survey imply that the order in 
which these four rationales are presented above corresponds to their frequency of 
application; that is, the rationale relying on accident for training in ethics is the 
most common, and that involving advocacy of particular ethical principles is unused 
(or at least not admitted to). As between the two intermediate rationales, there seem 
to be more programs intended to create an awareness of ethical issues than there are 
programs engaging their students in action with an ethical focus. 


The Prospects 


What further development,if any, should be pursued in this area? As a prelude to 
addressing this question, a summary of the apparent situation at present is in order. 
A review of relevant literature and practice suggests that, especially during the last 
fifteen years, preparation experiences in educational administration have featured 
considerable discussion of values at the conceptual level, exploration of various value 
systems and their implications for administrators, selected study of moral 
philosophy, some efforts at value-clarification exercises (at times associated with 
organization development or sensitivity training sessions), occasional exposure to 
content and products in the humanities as stimuli for examining moral dilemmas, 
and passing reference to the importance of “oughts” and subjective judgment in such 
key administrative tasks as decision making, problem solving, and purpose setting. 
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But there has been little serious, conscious effort to develop demonstrably in 
students the skills or behavioural propensities to act in ways that could be 
considered ethical; nor has the subject of ethics been clearly, consistently, or 
operationally defined in relation to such concepts as philosophy, values, sensitivity, 
authenticity, integrity, or morality. 

The various methods that have been assayed as vehicles for training in ethics 
range from the largely affective to the mainly cognitive in nature. One affective type 
of approach comprises the various human relations techniques. The importance of 
sensitivity, empathy, and communication skills to administration cannot be denied. 
However, these skills can be practised, and taught, without an ethical component 
and they may, in fact, have anti-ethical valences. In addition, they can prove 
threatening to some; and they largely lack the cognitive component that some 
believe is necessary to ethical development. Roald Campbell (1972) has said that 
“the group process people, particularly the extreme proponents, appear to have 
taken us in. The inner feelings of many people might just as well stay inner; most 
likely nothing is to be gained by exposing them” (p.13). He and others recognize the 
importance of affective elements in preparation for educational administrators, but 
they reject them as the primary preoccupation in ethical development programs. 


For similar reasons, the typical values clarification techniques appear 
unsatisfactory. While such approaches may play a useful supporting role in the 
development of ethical competence, they do not seem sufficiently cognitive in nature 
to bear the main burden. As usually practised, these techniques tend to be too 
game-like and simplistic for this purpose. Moreover, they often deal with values that 
are not essentially ethical in nature. 


A third questionable approach is the one apparently relied on most heavily by 
universities at present. It might be referred to as the “laissez-faire” approach, for it 
depends on models and experiences for ethical development, and involves no direct 
instructional effort. It is based almost entirely upon chance — upon the ethical 
qualities of models to which one may have had exposure in life, and upon the 
willingness and ability of one to analyze these matters from an ethical point of view 
in an intellectually rigorous way. Nevertheless, it is a phenomenon whose operation 
should be recognized and exploited in ways supportive of any primary approach that 
is adopted, for it does seem futile to try and teach ethics with credibility if one is 
perceived by one’s students to behave in unethical ways oneself. 

Another relatively common approach is the integration, whether intentional or 
incidental, of moral issues throughout the regular program. While it has the 
advantage of suggesting that ethical questions are not isolated problems but rather 
are an integral part of the daily life and experience of the profession, this approach 
suffers from two main weaknesses, as noted by Derek Bok (1976). First, it may not 
provide students with opportunities to reason carefully about moral issues: “Most 
professors have so much ground to cover that they will rarely take the time to 
acquaint their students wih the writings of moral philosophers on the ethical issues 
under discussion” (p.27). Secondly, Bok quotes a business school report which 
suggests that spreading responsibility for moral education among a large number of 
faculty members is disadvantageous because many of them lack the knowledge of 
ethics necessary to the task and, accordingly, tend to “give short shrift to the moral 
problems and concentrate on other aspects of the course materials that they feel 
more equipped to teach” (p. 27). A more distinct focus on ethics seems called for. 
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A fifth approach is one that several educational administration scholars urged 
strongly during the 1960’s (see, for example, Farquhar, 1970). It involves having 
recourse to the humanities, including Classics, as a foundation for the study of 
problems in values and moral dilemmas. Most of its erstwhile proponents would 
likely agree now that this movement never really “took off.” There are still several 
professors who are deeply committed to it and who are implementing it, but they 
comprise a rare breed; moreover, the extent to which their efforts are directly 
contributing to the ethical development of their students remains uncertain. There 
seem to be two main problems with this approach. One concerns instructional staff: 
most professors of educational administration lack the necessary familiarity with the 
humanities to do the job well; their relations with professors in the humanities are so 
distant, and perhaps tinged with awe, that importing the talent from there is not a 
realistic option at many institutions; and sending one‘s students across campus to 
take courses in the humanities departments risks the problem implied in this 
question by Edwin Bailey (1977): 

Is the knowledge which our students may gain in a “liberating” course worth the 

negative impact on self-image? Or stated conversely, are the illiberal attitudes of a 

particularly specialized and critical-of-non-majors professor in the liberal arts worth the 

“liberating” knowledge provided? (pp. 33-34). 

Secondly, a study reported in 1970 (Farquhar) indicated that a problem common to 
the humanities-based approach is the difficulty of achieving articulation between 
the humanistic content studied and the actual problems administrators confront in 
practice. The subject of ethics is a practical and immediate one. In approaching its 
study deductively from a base in the humanities, one starts at a point which, while 
perhaps exotic, is not sufficiently erotic to “grab” the attention of most educational 
administration “types.” The contemporary surge of utilitarian vocationalism is 
contributing to this problem. A more inductive approach seems called for, one that 
starts with a primary focus on real, contemporary ethical problems confronting 
educational administrators and then delves selectively into humanistic content from 
time to time as circumstances dictate. 


Finally, there is the more specific emphasis on the study of moral philosophy. 
Because this approach selects from the almost infinite humanities a particular 
component that deals specifically with thought about morality, the problem of 
professorial inadequacy is somewhat reduced in seriousness. But the problem of 
applicability remains. At the University of Rochester seminar on Ethics and the 
School Administrator a decade ago, the moral philosopher Lewis White Beck (1970) 
put it this way: 

the philosophical interest in ethics is not the hortatory or persuasive interest in trying to 
modify the way human beings behave. Moral philosophy is an intellectual venture, and 
if it attempts to modify human beings at all, it does so by trying to modify ways of 
thinking about moral questions rather than by trying to tell what the right answers to 
the questions are. (p.43)? 
It was Immegart’s view that, in pursuing the several objectives of that Rochester 
program, the least success was realized in applying ethics to the resolution of specific 
administrative problems (Farquhar, 1970, p. 33). To be meaningful, as Paul Kurtz 
(1969) says, “a moral principle must be applicable to concrete contexts and be 
workable in detailed cases” (p. 201). Yet, most courses in moral philosophy are 
simply surveys of ethical theory: they “do little to help the student cope with the 
practical moral dilemmas he may encounter in his own life” (Bok, 1976, p. 27). The 
moral philosophy approach may be more appropriate for the relatively relaxed and 
mature setting of mid-career in-service study,’ but for the less seasoned students in 
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educational administration programs it will probably remain unsatisfactory, at least 
pending success in developing a philosophy of administration which “begins in the 
dirt,” following the lead of Christopher Hodgkinson (1978, p. 170). (The thoughtful 
examination of the propositions set forth at the end of Hodgkinson’s recent book 
could, in itself, provide an excellent basis for the development of moral philosophic 
thought and ethical behaviour on the part of administrative trainees (pp. 202-221); 
the same is true of his “analytical model of the value concept” (p.111), with its 
heuristic implications for morality and ethical behaviour in administration.) 


But if all of these techniques seem in some ways unsatisfactory as primary 
vehicles for the development of ethical competence, what approach should be 
adopted? It would appear from the foregoing that a satisfactory vehicle would need 
to be cognitively oriented, inductive and applied in nature, and consciously 
distinguished within the educational administration preparation program. Some 
guidance toward this end may be gained from the moral education movement 
currently burgeoning in the school systems. While there are several different 
approaches to moral education (Purpel & Ryan, 1976), one that seems particularly 
appropriate for present purposes is the cognitive-developmental method developed 
by Kohlberg, because it focusses on the development of moral reasoning, an 
essentially cognitive process. This was the approach chosen by Glen Sinclair in a 
course pertaining to administrative ethics that he designed and tested at the 
University of Alberta a few years ago, and it appears to provide the underlying 
rationale for the program at the University of Wisconsin mentioned earlier. It also 
constitutes the basis for Bok’s (1976) affirmative answer to the question: “Can ethics 
be taught?” Bok views with approval the development in some professional schools 
(notably Law, Medicine, Public Affairs, and Business Administration) of courses in 
applied ethics. They generally proceed by discussion rather than lecturing, and their 
main purposes are “to help students become more alert in discovering the moral 
issues that arise in their own lives,” “to teach students to reason carefully about 
ethical issues,” and “to help students clarify their own moral aspirations” (p.28). 
They are problem-oriented, taking as their starting point a series of contemporary 
moral dilemmas drawn from the context of the profession concerned. 

In regard to methodology, it is well to remember the four main components of 
values education: (1) awareness of one’s own and others’ values; (2) sensitivity to the 
feelings, attitudes, and values of others; (3) moral reasoning about value-laden 
issues; and (4) ethical action which expresses or affirms prosocial behaviours (Meyer, 
1976, p. 215). The first two of these components may be seen as preliminary to the 
“main event,” and they can be facilitated through values clarification activities and 
human relations exercises. The heart of a course in applied ethics should consist of 
work on moral reasoning about value-laden issues; this can be effectively stimulated 
through the use of well chosen cases, and it can be enriched by selective recourse to 
works in moral philosophy and the humanities generally but its primary modus 
operandi is through discussion involving the application of intellectual skills to the 
analysis of professionally relevent ethical questions. It is important, however, not to 
stop short of the fourth component, ethical action; the significance of experience 
(which can be gained during preparation programs through simulations and 
internships) was demonstrated in the results of a comparative study of graduate 
students and practising executives in the field of business, which indicated that 
ethics tend to improve with experience (Fulmer, 1968, p. 39). 
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So one can envision a methodological sequence for courses in administrative 
ethics, beginning with the nonthreatening third-person discussion of cases or 
incidents portraying ethical dilemmas, moving to more ego-involving discussions as 
one’s own ethical competence becomes the focus of attention, and progressing to the 
point of personal action on ethical problems (first through simulations and finally in 
field experiences). This methodological progression in courses to develop ethical 
competence, then, is from awareness through analysis to action. 


The success of such efforts, of course, will be dependent largely upon the 
capabilities of the professors responsible for them. Bok (1976) observes that 
“unfortunately, it is simply a fact that many courses in applied ethics have been 
taught by persons with little qualification beyond a strongly developed social 
conscience” (p. 30). In suggesting what one needs to know in order to teach such a 
course competently, he says: 

To begin with, instructors must have an adequate knowledge of moral philosophy so 
that they can select the most useful readings for their students and bring forth the most 
illuminating theories and arguments that have been devised to deal with recurrent 
ethical dilemmas. In addition, teachers must have an adequate knowledge of the field of 
human affairs to which their course is addressed. Otherwise, they will neither be credible 
to students nor succeed in bringing students to understand all of the practical 
implications and consequences of choosing one course of action over another. Finally, 
instructors must know how to conduct a rigorous class discussion that will elicit a full 
consideration of the issues without degenerating into a windy exchange of student 
opinion. (p. 30) 
In educational administration, this combination of talents may initially require, in 
most universities, a team approach to teaching such a course. However, it would 
probably be preferable to recruit or retrain individual professors of educational 
administration who can take full responsibility for the instruction themselves. 


In order to design good courses in administrative ethics, considerable time and 
effort must be devoted to the determination of appropriate content and experience. 
Guidance for this may take such forms as “the educational administrator’s ethical 
screen,” developed by Immegart and Burroughs (1970, p. 95). This model is 
conceptually consistent with the ”continuum of interest” which Hodgkinson (1978) 
recognizes as starting from the ego and extending outward to ever larger and wider 
ranges of interest: 

extended ego in the family, kin-group, and clan; the work-group; the organization of 
major employment; the community of work and of leisure, the sub-culture; and finally to 
society, nation, and culture at large. At each discernible quantum shift along this 
continuum, conflicts of interest can occur. (p. 192) 
Frameworks such as these can be very helpful not only in thinking about ethics, but 
also as guides for the sélection or development of cases for use in courses on 
administrative ethics — a task toward which UCEA might well direct its attention. 
Similarly, as Miklos (1978, p. 4) has observed, good research is needed on such 
matters as the nature of ethical problems confronted by educational administrators, 
the ways in which they typically cope with such problems, and the characteristics of 
educational organizations insofar as they affect the ethical behaviour of 
administrators. (In this regard, it is encouraging to note the recent establishment in 
London, Ontario, of Canada’s first Centre for Research in Ethics, although its focus 
is mainly on biomedics, big business, law, and journalism.) 


Some serious homework is required if the development of ethical competence is 
to be fruitfully pursued in preparation programs for educational administrators. 
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There are, nevertheless, some promising beginnings, particularly in courses like the 
one on adminstrative ethics at the University of Wisconsin outlined earlier. 


Conclusion 


In conclusion, the main points made in attempting to answer the questions posed 
in the beginning of this article can be summarized as follows: (1) ethics can and 
should be included in the formal preparation of educational administrators; (2) this 
is apparently not done widely or well at present; (3) it should be done consciously 
and as a distinct component of training programs; (4) ethical competence is basically 
an intellectual capacity, arrived at through cognitive development, and its nurture 
should be approached inductively in an educational administration problem-based 
way, using humanistic literature as supplementary rather than foundational; (5) it 
should be taught by educational administration professors with the necessary 
expertise, which may require some new forms of training or retraining; and (6) 
considerable research and the development of conceptual models are needed as 
bases for program design efforts. 


A focus on ethics may present the “handle” to some exciting, important, and 
relevant training experiences that emphases on the humanities and values have 
somehow failed to provide. There is something vital, current, and personally 
pertinent about ethics that may be lacking in the humanities generally, as far as 
most administrators-in-training are concerned. Perhaps the day may come when 
entire preparation programs in educational administration are focussed on the 
development of ethical competence, with the selection of social science and 
humanistic content and of instructional materials and field experiences being 
determined by the nature of a few crucial ethical problems around which the 
programs are built. 


Notes 
I. The present use of this term is similar to that in Miklos’ (1978) call for us to help administrators “become more 
competent in dealing with the ethical aspects of their actions ” (p. 4). 
ve, It should be noted, however, that in a subsequent chapter, on “Towards the Ethics of Educational Leadership,” the 
educational philosopher Theodore Brameld expresses a contrary opinion (p. 72). 
3. This is the context in which it is used with military officers taking a year of graduate studies at the Naval War College 


in Newport, Rhode Island (This prof learned the hard way, 1979). 
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BOOK REVIEWS 


THE ABCs OF LITERACY. By Stephen N. Judy. Toronto: Oxford University 
Press, 1980, 361 pp., $21.25. 


The ABCs of Literacy is a statement of a contemporary reformer’s perception of 
education. Judy, the author, believes that the time is ripe for real reform in 
education, but he does not explain why. My view is that universal public education, 
being just over a century old, has hardly been considered seriously by first-rate 
philosophers and thinkers. There are exceptions and Dewey’s writing comes to mind. 
Anyone familiar with the works of Dewey will realize the pervasive “progressive” 
influence throughout Judy’s work. This suggests the concept propounded by 
William James that ideas go through a process of gestation. They are criticized, 
ridiculed and forgotten, then they re-emerge and are considered more seriously. We 
are entering the fourth generation of Dewey’s ideas and it is clear that they now 
form the woof of the fabric of contemporary educational thought. 


Before being introduced to current theories the reader is introduced to some of 
the main criticisms of the schools and to a consideration of some of the main 
responses to those criticisms. I believe Judy is substantially correct in his review of 
the criticisms and in his responses to these criticisms. Those who complain about 
literacy in the schools simply do not pay sufficient attention to the data; their 
consideration of the evidence is usually unscientific and the testers have a vested 
interest in keeping things unclear or ambiguous. It is a damning statistic to show 
that there has been a decline in achievement in verbal and math tests that has 
averaged out at 40 points in recent years. It is inexcusable that the meaning of this 
decline has not been carefully explained and it is immoral not to emphasize that the 
strange grading sytem used by some testing services makes a variation of 60 points 
statistically insignificant. This is what Judy’s analysis suggests. 

The ancient fallacy that today’s generation is not as good as the last is explored. 
It is pointed out that literacy is perceived to be a human characteristic that carries 
with it a burden of unrelated qualities. People who can read or write are somehow 
deemed to be “better” in complex and varied ways than those who cannot read or 
write. Accepting the intellectual superiority of readers and writers is easy and 
obvious but the moral or ethical superiority of readers over non-readers is much 
more conjectural. It would seem that contemporary critics of education merely 
reflect a current version of the Greek view that the world has deteriorated since it 
was ruled by Olympians. Judy produces samples of evidence from the last 200 or so 
years that clearly indicate that there never was a golden age of literacy and today is 
a dark age only in the minds of those who want to see it that way. Strangely, these 
critics consistently make statements that are bereft of evidence and reveal an 
extraordinary ignorance of the problems associated with these kinds of assertions. 
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After highlighting some of the pseudo problems of education, we are introduced 
to the “real issues.” These issues, it is argued, are associated with the growing 
detachment of education from society. Greek education, Judy claims, was always 
integrated with the needs of society and the children of Greek aristocrats learned 
what was functional and relevant. By the 1840’s the wealthy young Americans may 
well have known how to decline the Latin verb “to swim,” but many of them started 
out to become naval officers but were drowned before they were commissioned. The 
Greek aristocrat learned to exist under the conditions of his day as well as learning 
rhetoric, poetry and mathematics. 


Judy reveals a main issue of contemporary education when he cites this kind of 
detachment from reality. He presses the point further when he indicates the kind of 
“decaling” that goes on in classrooms. That is, the tendency to analyze writing to 
reveal topic sentences, unity, coherence, introduction, conclusion, body. . . the 
parsing of sentences into verbs, nouns, gerundives, etc. It is little wonder that 
children do not read and write as well as they should. They simply do not spend 
sufficient time on these “wholesome” pursuits. Reading and writing about 
significant issues will do more to improve reading and writing than all the 
grammatical exercises in all the workbooks used in Canada. 


The first part of Judy’s book is an effort to establish a set of arguments which 
justifies the introduction of an enormous variety of activities where language is 
shown to be the medium of education. There is something awry with this argument. 
For example, Judy writes: 

We must educate children through language to the level of their capabilities, by 
providing a good, general. . . education. 

Many of the exercises Judy cites are taken from the sciences, career guidance, 
sociology, anthropology, and other areas of learning. The point is, to grasp these 
activities properly, teachers would be required to have an understanding of the 
disciplines from which the exercises derive. This is an interesting theoretical issue. 
Should the teacher who is competent in language and educated in a discipline (or 
a couple of disciplines) be the basic teacher or should teachers be experts in 
language and then make sorties into a variety of disciplines. 

My own view is that the former approach is reasonable because those trained in 
language tend to concentrate on language and “literature” rather than on the 
dynamics of a particular discipline. Most students who wish to become teachers have 
had 12 years of language in schools — what they need is to develop competence in 
particular areas and not necessarily more competence in language per se. Indeed one 
of the main problems with contemporary education is the literary orientation that 
dominates the schools. Another is the tendency to “academicize” everything. 
Children need to get out into the world and to make observations, predictions, and 
assessments as if they were were scientists, historians, sociologists, and so on. This 
requires more than a thorough knowledge of language and it is also an issue that 
Dewey explored. 


This is a useful volume which presents a case clearly, forcefully and well. It is a 
useful volume because it reveals the mindlessness of the typical conservative 
argument. It is also an excellent volume for the classroom teachers who desire to 
think about the future, as it contains some interesting ideas about planning and is a 
tremendous resource for activities. There appears to be an inconsistency between 
the arguments for a more relevant approach to education and the emphasis on 
language, but this is a technical and theoretical argument that does not detract from 
the usefulness of this volume for public and high school teachers. 


Donald A. Maclver 
Dean of Education 
University of New Brunswick 
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THE CANADIAN CURRICULUM: A PERSONAL VIEW. By Robin Barrow. 


London, Ontario: The University of Western Ontario, 1979, 118 pp. ISBN 
0-920354-04-1. 


Robin Barrow, Lecturer in Philosophy of Education at the University of 
Leicester, spent the academic year 1977-78 as a Visiting Professor in the Faculty of 
Education, University of Western Ontario and, on the basis of this experience, wrote 
an essay, the main thesis of which is that Canada ought to adopt a compulsory core 
curriculum with Barrow’s guidelines as to content. Barrow arrives at this thesis 
rather wanderingly via a discussion of “progressive” and “radical” critics of 
educational practice. Each gets a chapter and, likewise for loosely collected 
discussions about “Classroom Practice” in which he includes justifications for 
English and history on the curriculum, some comments on the unlikelihood of 
Canadiana inspiring a sense of Canadian identity, values education, evaluation and 
models of teaching, sundry other topics, and “The Faculty of Education Curriculum” 
which deals mainly with what ought to be taught to student teachers. 


What Barrow has to suggest about the curriculum is that there be a primary 
stage (Grades 1-7) concerned with developing literacy, numeracy, computer 
competence, physical health and pursuit of (educational) topics that interest the 


children, e.g. handicrafts. There should be, next, a transition year (Grade 8) during 
which the natural sciences, mathematics, and fine arts are studied. There should 
then follow a secondary stage (Grades 9-12) devoted to English, history, social study, 
and French. 

What is controversial about this is of course the emphasis to be placed on the 
humanities at the expense of the natural sciences: twice the amount of time as is 
presently spent in Ontario on history and English and one-half the amount on 
mathematics. Surely Barrow is right to want a redress in the present over-balance of 
sciences to humanities; but his proposal would swap one imbalance for another, and 
in consequence leave students largely ignorant of the very underpinnings of our 
technological society. The trade off in favour of developing the skills of 
communications, which Barrow takes to be justification for his preponderantly 
humanistic curriculum, is simply too expensive. 

Barrow may be correct, turning to his main thesis, that we ought to have a 
compulsory core curriculum; but this is not convincingly shown by any of his 
arguments. His “argument from knowledge” shows that there are some propositions 
that are supracultural (e.g. mathematical propositions) but not why they ought to be 
taught. The “argument from culture (small ‘c’)” is intended to show that national 
unity depends upon having (some) shared culture; hence the need for everyone to 
study history, etc. He is surely right in some matters: we must all agree to obey 
common laws, for example. But Barrow seems to have missed the point about the 
desire in Canada to preserve cultural diversity. National unity is not the same as 
cultural homogeneity. The “argument from Culture (capital ‘C’)” fails because, 
although it succeeds in the claim that various fields, music for example, have 
reasonably objective criteria for determining quality, it does not show why on the 
curriculum there should appear, for example, the best classical music but not the 
best rock music. The “argument from choice” nicely shows why students ought to 
study complex subjects, namely because they are not easily picked up on their own, 
but it does not provide any reason for insisting that students should be compelled to 
study one particular complex subject rather than another. 

The essay is disappointing in other areas as well. For instance, for whom is 
Barrow writing this? If curriculum theorists, philosophers of education, etc., then he 
is enormously patronizing in tone and topic. If students of education, then he 
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touches on far too many topics without drawing them together well or dealing in 
sufficient depth with any. A related sin is that many of his excursions are 
unnecessary to sustaining his overall thesis — this is especially true of the two 
chapters devoted to (inexplicably selective) discussions of radicals and progressives. 
It is solid philosophical discussion of the curriculum that we need regardless of who 
writes it. Barrow, however, is superficial and impressionistic. Perhaps this is so 
because Barrow’s Canada is Ontario. He admits to knowing little about the rest of 
Canada, yet fails to see this as a problem when making national curriculum 
proposals. 

In sum, what could have been a useful contribution to the important matter of a 
core curriculum is impressionistic and weak. 


A. S. Carson 
Department of Educational Foundations 
The University of Alberta 


PIAGET’S THEORY: A PSYCHOLOGICAL CRITIQUE. By G. Brown and C. 
Desforges. London: Routledge & Kegan Paul, 1979, 196 pp. 


In Piaget’s Theory: A Psychological Critique, the authors have combined an 
overview of Piaget’s theory with an in-depth empiricist critique. If the book is 
judged in terms of how it meets both these ambitious goals, it will be seen as lacking 
in several important respects. However, if it is more modestly viewed as an attempt 
to assemble the growing accumulation of empirical data which now poses serious 
problems for orthodox Piagetians, then the book indeed fulfills a useful function. 


. For the student new to Piaget, the book can not replace the more accessible 
introductions to his theory such as are offered by Ginsburg and Opper (1969) or 
Flavell (1963). The overview of the theory which Brown and Desforges provide 
focuses primarily on the concept of stages and their invariance, the 
logico-mathematical structure of children’s thought, and the equilibrium process. 
Few examples are included and only a cursory definition of many of Piaget’s terms is 
provided. 


Although the authors of this book are English, the description and critique of the 
theory is definitely biased towards the North American perspective. The authors’ 
aim is to demonstrate that Piaget provides a competence model of development, 
rather than a performance model. The claim is that this fact insulates the theory 
somewhat from direct empirical examination and makes it more difficult to assess. 
Nonetheless, the theory stimulated considerable empirical research, and has been 
the subject of numerous critiques. (cf. Flavell, 1963; Brainerd, 1978). 


The major thesis of the book is that Piaget’s theory, when subjected to strict 
critical examination in light of the assembled empirical date, fails on three accounts: 
(1) much of the theory is untestable; (2) that which is testable in the theory has 
often been shown to be wrong; and (3) the theory has become overburdened by post 
hoc constructs such as “decalage” and “task resistance,” which serve to salvage the 
theory at the expense of parsimony and predictability. In addition, the authors claim 
that Piaget ultimately provides no explanation of development, that is, how 
structures evolve from stage to stage. 


208 


In their critique, Brown and Desforges review many of the well known criticisms 
which have been voiced by North American researchers such as Brainerd, Gelman, 
Trabasso and Bruner. These and other researchers have shown that given special 
conditions, the onset of particular logical competencies can be moved up or down the 
developmental scale by manipulating task components or the experimental setting. 
Likewise, stage-like behaviour can be made to appear or disappear by manipulating 
assessment criteria; logical behaviour can be induced via certain training procedures, 
and even very young children can be shown not to be egocentric, given the right set 
of conditions. For example, Piaget has claimed that preschoolers can not conserve 
number or quantity. Yet Gelman (1972) has shown that three-year-olds can conserve 
number if the set size is limited to under four items. One must inquire whether such 
limited demonstrations of logical competencies appearing much earlier than Piaget 
has claimed (and out of step with other concrete operational abilities) in any sense 
invalidate his theory as Brown and Desforges seem to imply. In my view, they do 
not. In the above example, young children could only conserve sets which they could 
subitize, i.e., sets which they could quantify in a single fixation without the 
purposeful allocation of attention. Thus, is this ability to recognize changes in 
quantity, when only two or three items are involved, an example of a general logical 
competency? Or is it, on the other hand, a cognitive precursor to conservation which 
is built into the human information processing system? 


The authors are undoubtedly correct in pointing out the deficiencies in certain of 
Piaget’s explanations and constructs. Still, their critique often seems to lack focus. 
Their criticisms tend to range back and forth from the almost trivial to the 
profound, without them always being clear on what is at issue. The source of this 
inconsistency lies, ultimately, in the nature of this book itself. Critiques which 
merely serve to point out problematic or anomalous data from the experimental 
literature are generally less effective in identifying or clarifying key theoretical 
issues than critiques which are embedded in the presentation of a rival theoretical 
position or paradigm. 

In the final chapter of the book, Brown and Desforges do discuss several recent 
attempts to develop more performance oriented theories. These theories, they imply, 
can in some sense be viewed as rivals to Piaget’s theory of development. The 
theoretical positions which the authors present are Pascual-Leone’s theory of 
constructive operators and information processing theory. These positions are 
unfortunately described only in barest detail. A more in-depth account, describing 
how these theories handle both the data handled so well by Piaget’s theory and the 
data which have become so problematic for it, would have been useful to help the 
reader to place the foregoing criticisms in clearer perspective. 

This book is one more indication that Piagetian theory is currently undergoing a 
degenerative phase in its development. The theory has become increasingly 
burdened by anomalous data, post hoc explanations, and constructs for which 
empirical support is not forthcoming. In addition, factors which the theory does not 
address at all, (e.g., the growth of processing capacity) have recently been shown to 
have a significant effect on the course of development. It remains to be seen whether 
Piaget’s theory will recover from these challenges and enter a new progressive phase, 
or be superseded by one of the newer functional approaches to development. For 
now, this book stands as a good account of the present weaknesses in Piaget’s 
theory, as it is perceived from the strict North American perspective. 
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GRAHAM HURLBURT 


The University of Manitoba 


Psychoticism, Teacher Appraisal, and Life Styles 
of a Canadian High School Sample 


Data were gathered from a medium sized high school in the middle class suburb 
of a western Canadian city. Using the Eysenck Personality Questionnaire, 
teacher appraisal, and life style questionnaire, the study found significant 
differences (p<.05) between high and low psychoticism scorers on teacher 
appraisal, alcohol consumption, cigarette smoking, marijuana smoking, and sex 
of student; no significant differences (p<.05) were found between high and low 
psychoticism scorers on TV viewing and residence with one or both parents. 


The bane of countless generations of school teachers has been the 
uncooperative, restless, disruptive student. Tales of woe by weary pedagogues 
relate to classroom lessons being subverted by the antics of an antisocial few. The 
emotionally disturbed and the hyperactive student can conspire to interrupt the 
educational process in the classrooms of high schools. 


Is classroom and out-of-school behaviour related to high psychoticism? What 
are the characteristics of the high psychoticism scorer? Eysenck and Eysenck 
(1975) studied 540 males and 590 females between 16 and 19 years of age using 
the Eysenck Personality Questionnaire. They found that high psychoticism scorers 
were solitary, not caring for people and were often cruel and inhumane, lacking in 
feeling and empathy, and altogether insensitive. They were hostile to others, even 
to family and peers, and aggressive, even to loved ones. They had a liking for odd 
and unusual things, and a disregard for danger; they liked to make fools of other 
people and to upset them. 


Eysenck and Eysenck concluded that high school aged high psythoticism 
scorers made up for lack of feelings by indulging in sensation-seeking “arousal 
jags” without thinking of the dangers involved. Socialization was a concept 
relatively alien to them, as were empathy, guilt, and sensitivity to others. This 
description, of course, referred in its entirety only to extreme examples; persons 
scoring relatively high, but nearer the middle range of scores, were represented 


Graham Hurlburt completed a PhD in Counselling from the University of North Dakota in December 1980, and is currently 
employed as Coordinator for Learning Assistance and Counselling for Native Students at the University of Manitoba 
Counselling Centre. His research interests focus on alcoholism and education. 
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more frequently than extremes, and would only show these behaviour patterns to 
a much less highly developed degree. 

Certain previous studies on high school populations using the Eysenck 
Personality Questionnaire indicate psychoticism is related to antisocial behaviour. 
Saklofske (1977) studied four groups of 20 adolescent boys on the basis of 
self-report ratings of antisocial behaviour, teacher’s ratings of classroom 
misbehaviour, and the administration of the EPQ. Significant differences on 
Eysenck’s psychoticism scale were obtained between both high versus low 
antisocial behaviour groups and well behaved versus badly behaved teacher-rated 
groups. Extraversion and psychoticism scales discriminated between students 
scoring high and low on an objective measure of school naughtiness. Subjects rated 
by teachers as badly behaved obtained significantly higher psychoticism scores 
(x=7.78) than their well behaved peers (x=6.08). Also, boys high in antisocial 
behaviour scored significantly higher on the psychoticism scale (x=9.33) than did 
boys low in antisocial behaviour (x=4.53). Eysenck’s theory, that degree of 
psychoticism is capable of predicting misbehaviour engaged in by adolescent boys, 
was supported. 

That there are national differences in psychoticism levels of students is 
indicated by some studies. Eysenck, Adelaja and Eysenck (1977) studied 329 male 
and 101 female Nigerians, and 404 male and 544 female British students to 
compare the two groups. Using the EPQ, they found that the Nigerians had a 
much higher psychoticism score, females had a much lower psychoticism score in 
both British and Nigerian populations, and Nigerian females had a much lower 
psychoticism score than British females. 


Hosseini, Mehyrar and Razavich (1973) studied 768 Iranian male students and 
241 Iranian female students aged 18 to 20 years. Using the EPI, and the Persian 
form of the psychoticism scale they found that Iranians scored higher than the 
British in psychoticism, and Iranian females scored lower on the psychoticism 
scale than Iranian males. Iranians were higher in psychoticism, concluded Hosseini 
et al., because of social breakdown brought about by too rapid urbanization. 


Method 


Subjects 


The sample consisted of 87 high school students to whom the EPQ, a teacher’s 
subjective appraisal, and a life style questionnaire were administered. The sample 
was obtained from the representative daily teaching load of grade ten, eleven, and 
twelve students of a high school English teacher. The subjects, aged 15 to 19 
years, consisted of 39 males and 48 females who attended Dakota Collegiate (900 
students) in a middle class suburb of Winnipeg, Manitoba. With one exception, all 
students were Caucasian and most were born locally. 


Instruments 


To measure the degree of psychoticism in the students, the Eysenck 
Personality Questionnaire was chosen. The EPQ is a 90-item test which measures 
what Eysenck maintains are the three main dimensions of personality: 
Psychoticism, Extraversion-Introversion, and Neuroticism-Stability. All three are 


viewed as higher order factors pertinent to normal as well as pathological 
populations. 
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To construct the new psychoticism scale, Eysenck created a 106-item 
questionnaire to measure different aspects of psychoticism and administered the 
questionnaire to 821 women and 512 men constituting a random sample. Eysenck 
found that there existed items, theoretically relevent to the hypothesis of a 
psychoticism factor, that cohered together factorially in a consistent manner, 
showed relative independence of neuroticism and extraversion, measured 
psychoticism reliably, gave similar results for males and females, and produced 
repeatable patterns. 

The Psychoticism Scale consists of 25 items, 14 of them keyed for a “yes” 
response. The items involve a mixture of risk taking, hostility, and insensitivity, 
and have the overall flavour of psychopathy. 


Tellegen (Buros, 1978) described the high psychoticism scorers as careless, 
impulsive, spurning convention, inconsiderate, unfeeling, and cruel; and also as 
disliked, deceived, and mistreated. They professed marked social alienation and 
perceived themselves as both target for and source of malevolent action. 


The Extraversion Scale consists of 21 items, 18 of them keyed to a “yes” 
response. The items carried over to the Extraversion Scale from the EPI or 
written anew emphasize primarily the sociability component and de-emphasize the 
compulsivity component of the former Extraversion Scales. The Extraversion 
Scale is more homogeneous factorially on the EPQ than the EPI. 


Eysenck describes the extraversion factor as being closely related to the degree 
of excitation and inhibition prevalent in the central nervous system. Eysenck 
describes the typical extrovert as one who is sociable, likes parties, has many 
friends, needs to have people to talk to, and does not like reading or studying by 
himself. He prefers excitement and action to caution and solitude. 

The Neuroticism Scale consists of 23 items, all keyed to a “yes” response. 
Neuroticism is closely related to the inherent degree of liability. The scale 
discriminates between normal adults and neurotics. Eysenck describes the high 
neuroticism scorer as being anxious, worried, moody, depressed, overly emotional, 
overly reactive. The stable individual responds emotionally only slowly and 
generally weakly and returns to base line quickly after emotional arousal; he is 
usually calm, even-tempered, controlled, and unworried. 

The Lie Scale consists of 21 items, 6 of which are scored for a “yes” keyed 
response. The scale basically reflects need for social approval, which is manifested 
in defensiveness, dissimulation, and social naivete. The items are heterogeneous in 
content and roughly balanced in keying. 


Reliability Studies 

The EPQ appears to have satisfactory reliability. 

Tellegen (Buros, 1978) studied a sample of 5574 people who were 
representative of urban England. He found that internal consistencies established 
by alpha coefficients, presented for samples of female and male “normals” and 
prisoners averaged about .85 for the Neuroticism and Extraversion Scales. Kline 
(Buros, 1978) studied the test-retest reliabilities of the Psychoticism, Extraversion, 
Neuroticism, and Lie Scales using 257 subjects (mostly students). He found 
test-retest reliabilities of psychoticism (.78), extraversion (.89), neuroticism (.86), 
and Lie (.84), and concluded that these data indicated the four scales to be 
sufficiently reliable for use with individuals. The reported internal consistency 
reliabilities ranged from .68 to .88, with a median of .84. The stability coefficients 
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determined on normal sample for corresponding value for the Psychoticism Scale 
was somewhat lower, .78. Scale intercorrelations were quite low, indicating that 
the scales have very little redundancy. 

Stricker (Buros, 1978) studied the internal-consistency reliability for normal 
examinees on the EPQ. He found that coefficients ranged from .79 to .85 for the 
Extraversion, Neuroticism, and Lie Scales and was .68 to .74 (males and females) 
for the Psychoticism Score. He found that test-retest reliability, over a one-month 
interval for normal examinees, ranged from .80 to .90 for the Extraversion, 
Neuroticism, and Lie Scales, and was .71 and .83 (male and female) for the 
Psychoticism Scale. 

Loo (1979) studied 139 female and 123 male university students using the EPQ 
to obtain scale statistics and to perform hierarchical analysis. He found the 
internal consistency reliabilities of the Extraversion, Neuroticism, and Lie Scales 
to be in line with those reported in the EPQ Manual. 


Validity Studies 


The EPQ has been successfuly correlated with the major scales of other 
validated instruments. Wakefield, Sasek, Brubaker and Friedman (1976) studied 
samples of 79 and 53 individuals respectively, using the EPQ compared by means 
of canonical analysis to the Meyers-Briggs Type Indicator and to the Adjective 
Check List to determine the validity of the EPQ. They found that the 
Extraversion and Thinking Scales of the Meyers-Briggs Type Indicator and the 
EPQ Neuroticism Scale were significantly negatively correlated. The 
Meyers-Briggs Type Indicator Judgment Scale was significantly correlated with 
the EPQ Lie Scale. The EPQ Psychoticism Scale was positively correlated with 
the Adjective Check List needs for Autonomy and Aggression and negatively 
correlated with the needs for Nurturance and Deference. The EPQ Extraversion 
Scale was positively correlated with needs for Dominance, Heterosexuality, 
Exhibition, Autonomy, and Change; and negatively correlated with the needs for 
Succorance, Abasement, and Deference. EPQ Neuroticism was positively 
correlated with the needs for Succorance and Abasement, and negatively 
correlated with the need for Dominance. 

The canonical analysis of the Adjective Check List and the EPQ yielded two 
significant canonical components. The first (R=.77, X¢?=96.68, p<.01) had a 
large canonical weight on the EPQ Extraversion Scale (.92) and on the Adjective 
Check List Achievement (.55), Dominance (-.49), and Succorance (-.41) Scales. 
The second (R=.70, X4.?=58.78, p<.05) had large canonical weights on the EPQ 
Psychoticism (.71) and Neuroticism (.53) Scales, and on the Adjective Check List 


Nurturance (-.54) and Aggression (-.57) Scales. The results supported the validity 
of the EPQ. 


Friedman, Wakefield, Boblitt and Surman (1976) used the EPQ with 31 
schizophrenics formed into a paranoid schizophrenic group (n=10) and a 
nonparanoid schizophrenic group (n= 21) in order to investigate the validity of the 
Psychoticism Scale. A comparison group of 20 aids were also given the EPQ. They 
found that significant differences between the normals and psychotics was 
consistent with the validity of the Eysenck Psychoticism Scale. That normals 
differed from Psychotics on the Psychoticism Scale but not on the others 
suggested support for the discriminant validity of Eysenck’s Psychoticism and 
Neuroticism constructs, they concluded. 
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Procedure 


Permission from the teacher was obtained to administer the EPQ with an 
accompanying lifestyle questionnaire which asked for information related to 
students’ daily TV viewing, weekly alcohol consumption, whether or not they 
smoked cigarettes and/or marijuana, and whether they lived with one or both 
parents. 


The classroom teacher was asked to evaluate each student in terms of his or 
her classroom behaviour. Students who were attentive, cooperative, and had an 
attention span greater than ten minutes were given a positive evaluation; those 
who tended toward inattentiveness, uncooperativeness, and had an attention span 
of under 10 minutes were given a negative evaluation. The classroom teacher had 
taught English at the school for 12 years, and was considered by both staff and 
students to be competent and well-liked. 


Each student filled in the EPQ with the attached questionnaire at his or her 
desk during classroom time. The teacher related his subjective evaluation of each 
student without seeing the student’s EPQ psychoticism score. The data were 
coordinated and analyzed by the writer. 


Based on the EPQ psychoticism score, the sample was divided into three 
groups: 29 high psychoticism scorers, 29 middle psychoticism scorers, and 29 low 
psychoticism scorers. The high group scored between 5 and 9, the middle group 
scored between 3 and 4, and the low group scored 2 or less on the EPQ 
Psychoticism Scale. In the present study, the high and low psychoticism groups 
were related to teacher evaluation and individual student behaviours. The chi 
squares in Table 1 have one degree of freedom for each categorical variable 
matched with the psychoticism scores. 


TABLE 1 


PSYCHOTICISM RELATED TO TEACHER EVALUATION 
AND HIGH SCHOOL STUDENT BEHAVIOURS 


Categorical Variables Matched v2 p 
with Psychoticism Scores 
Teacher Appraisal (positive or negative) DPR NAT ae On 
TV Viewing (2 hours per day or less) Bi 927 2 0)5 
Alcohol Consumption (5 drinks per week or more) WENGE) s gl05 
Cigarette Smoking (smokes or doesn't) 20.088 << (Oil: 
Marijuana Smoking (smokes or doesn't) 14.224 <n OS 
Parents (lives with one or both parents) 4.036 Sa (O15) 
Sex (boy or girl) 16.667 HOE 


Discussion 


It was found (see Table 1) that there were significant relationships between 
psychoticism scores and teacher appraisal, alcohol consumption, cigarette smoking, 
marijuana smoking, and the sex of the subjects. It would appear that high 
psychoticism scorers received a negative teacher appraisal, drank more alcohol, 
smoked more tobacco and marijuana, and tended to be male rather than female. 
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There were no significant relationships found between psychoticism scores and the 
subjects’ TV viewing habits, or residence with one or both parents. 


Summary 


That there is some relationship between psychoticism, teacher appraisal, and 
many out-of-school behaviours by students seems to be indicated by the present 
study. Further research may be indicated to explore in more detail predictive 
variables and psychoticism in high school students. 
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WILLIAM T. FAGAN 


The University of Alberta 


A Longitudinal Study of Grade Five and Grade 
Eleven Students’ Written Language and a 
Comparison with Teachers’ Written Language 


Written language may be studied from a number of thrusts: context, audience, 
process, linguistic output. This study focussed on the latter and interpreted the 
language used within the Semantic Potential Theory of Language. The main 
tenet of this theory is that language consists of various types of information 
and provides input for meaning. A comparison of the written language of grade 
five students, a sample of these same students in grade eleven, and a group of 
teachers shows that on all measures, grade eleven students differed significantly 
from their grade five performance. Few differences occurred between teachers 
and grade eleven students for total amounts of linguistic information used; 
when differences occurred on linguistic information proportionate to utterances, 
the grade eleven students scored higher. 


Language is a facilitator of thought, and is the basic medium of 
communication. Temporally, for most people, oral language precedes the 
development of written language, and whereas both language systems (oral and 
written) are related, the exact nature and direction of this relationship is not 
known. Wrolstad (1976) advises that each language system should be studied in its 
own right before the relationships across systems are studied. 

Written language has been studied from a number of perspectives. Although 
oral and written language serve a communication function, the nature of the 
communication function in written language differs somewhat from this function 
in oral language. Britton, Burgess, Martin, McLeod and Rosen (1975) have 
identified within written language the function categories: expressive, poetic, and 
transactional. The latter may be divided into informative and connotative writing. 
The concept of audience (self, teacher, or wider public) was also addressed in the 
Britton et al. study. 


Dr. Fagan is a Professor in the Department of Elementary Education at The University of Alberta. He has been a teacher of all 
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A recent trend in research studies in writing has been to study the processes 
that lead to and are involved in the production of writing (Cooper & O’Dell, 1976; 
Emig, 1971; Graves, 1973). However, most research studies in writing have 
focussed. on an analysis of the nature of language aed (Bateman & 
Zidonis, 1966; Hunt, 1965; Mellon, 1969; O’Hare, 1971). 


In order to develop an understanding of written language in toto, it is 
necessary to pursue research along various thrusts. Certainly information on the 
function and audience, the context in which writing takes place, the ongoing 
processes that make writing possible, and the interaction of these factors should 
contribute significantly to an overall understanding. However, the nature of the 
language produced is also a vital component in written language, and knowledge of 
it is important to understand the other factors fully. The nature of written 
language has been largely described within a transformational generative grammar 
framework. This theory of language “is now being increasingly challenged” (King, 
1978, p. 194), particularly because of its focus on the sentence of isolation, and its 
“over-concern” for syntax. The notion of deep structure, at least its amenability to 
research, has also been criticized (Baker, 1976). 


Language is one product of writing and the nature of this language has 
implication for meaning. The Semantic Potential Theory of Language (SPTL) was 
devised to analyse language from this focus (Fagan, 1978). It is used in this study 
as a framework to analyse the written language of a sample of grade five children, 
a sample of grade eleven children (some of whom were from the grade five group 
six years later), and a sample of teachers for comparison purposes. 


Related Literature 


The analysis of written language has been approached from a variety of 
measures: context, function, mode of discourse, processes involved, and form. This 
study is concerned primarily with the form of written language, that is, the nature 
of the language that is actually produced. The measure of language employed for 
this purpose has been usually determined by the theory of language that is being 
popularized at the time. The studies to be reviewed will be grouped according to 
periods when a particular theory prevailed. 


Traditional grammar. One of the earliest studies on the form of language was 
by Stormzand and O’Shea (1924) who analysed ten thousand sentences from many 
sources including children’s and authors’ writing. One thousand of these sentences 
were from the writing of grade four, five, six, seven, and eight students. The key 
measures were sentence length, a ratio indicating degree of subordination, and 
frequency of noun, adjective, and adverb clauses (time, place). Developmental 
trends existed for sentence length and degree of subordination. There were 
inconsistencies with respect to clause type. The measures used in this study (with 
a few additions) became, as Hunt (1965) said, “standard procedure” in written 
language development studies. Studies to follow with similar measures and results 
were: Bear (1939), Biscoe (1951), Frogner (1933), Hoppes (1933), LaBrant (1933), 
Sam (1962), Seegars (1933), Watts (1944), and Wise (1958). 

During the period in which these studies were conducted, the sentence as a 
valid index of written language development was being questioned. Concern was 
first raised by Seegars (1933) who suggested clause measures as a substitute. 
However, the use of the sentence persisted and was still being questioned in 1962, 
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when Sam (1962) found he could not rely on punctuation to determine sentence 
boundaries and appealed to the criterion of a sentence being a “complete 
thought” — which was as much philosophical as it was pragmatic. 

Structural grammar. During this period, the key unit of analysis became the 
various types of sentence patterns (Noun-Verb-Noun, Noun-Be-Adv, etc.) which 
were specified by Strickland (1962). Studies during this period identified patterns 
which did or did not increase developmentally (Sam & Stine, 1965; Zenman, 1969). 


Hunt’s (1965) is a landmark study of written language development. Samples 
of writing were obtained from grades four, eight, and twelve students, and from 
Harpers and Atlantic magazines to compare the students’ performance to that of 
mature writers. 


Hunt’s study is really a transitional study in the sense that it spanned two 
periods dominated by different linguistic theories (structural and transformational 
generative). However, one thing which made the study unique was that its key 
unit of analysis was from neither of these theories. Hunt first employed the t-unit 
(main clause and attached subordinate clauses), which showed a marked increase 
in length across grade levels, and which has had a profound effect on subsequent 
studies. Hunt was not only interested in t-unit length but in those linguistic 
factors that contributed to its length. He identified a number of such factors — 
some of which could be roughly equated with transformations from 
transformational generative grammar. Studies by Bortz (1969) and Perron (1977) 
also used the t-unit but showed that its length increase was influenced by the 
mode of discourse analysed. 


Transformational Generative Grammar. One of the basic tenets of this theory 
was that each sentence originates as a kernel (a simple form) and develops 


complexity by the action of various transformational structures. The number of 
steps necessary to finalize each transformation per sentence were supposed 
indicators of language development. Studies during this period usually attempted 
to identify which transformations occurred at different age or grade levels. Such 
studies were conducted by Braun and Klossen (1973), Gebhard (1978), Maimon 
and Nodine (1978), and O’Donnell, Griffin and Norris (1967). 

Although some of the studies conducted during this period continued to use 
the t-unit, the basic unit of the theory was the sentence — with the inherent 
problems of identification. Furthermore this analysis was entirely syntactic. 
Nevertheless from these studies arose the movement of “sentence-combining” as 
the way to bring about increased maturity or development in written language 
(Combs, 1975, 1976; Fischer, 1973; Mellon, 1969; Miller & Ney, 1968; Morenberg, 
Daiker & Kerek, 1978; O’Hare, 1971; Pedersen, 1977; Perron, 1974; Stewart, 1978). 


Semantic Potential Theory of Language. This theory was devised in response 
to some of the criticisms of transformational generative grammar, particularly the 
difficulty in researching deep structure, and the narrowness of focussing on the 
sentence in isolation from the total discourse. A study utilizing this theory in 
describing written language development was conducted by Cameron (1979). An 
analysis of the written language of nine-, ten-, and eleven-year-old children 
showed changes within each of the categories of information identified in this 
theory. The present study employs this theory to analyse the written language of 
grade five children, a sample of these children in grade eleven, and writing from a 
group of teachers. 
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Design 

In 1973, twenty grade five students and twenty upper elementary teachers 
participated in a written language study. In groups of five, they were shown 
materials on “floods.” The materials consisted of a film loop, a filmstrip, pictures, 
and newspaper clippings. The film loop and film strip were shown first. Subjects 
were allowed to remain as long as they wished to study the pictures and clippings. 


Subjects were then taken individually to a small room where they wrote an 
account of what they had seen. Subjects had been told to pretend that this was a 
class lecture on social studies and one of their classmates (or for the teachers, one 
of their pupils) had recently moved to a different province. They were asked to 
tell that individual in a letter to keep “in touch” all they could about floods which 
were studied in today’s class. 


Because directions were not followed in some cases, and because some had not 
finished within the time available, protocols for analysis were available for 
fourteen of the twenty teachers. 


Six years later (1979) a search was made for the subjects within the grade five 
group; eight were located. The others had transferred outside the geographic area 
and were not available. A sample of eight other grade eleven students was chosen 
randomly from one school and since an analysis of data did not produce any 
significant differences on variables between these groups, the sixteen subjects were 
treated as a single group (grade eleven) for purposes of data analysis. 


Data Analysis 


The analysis of the written language protocols was based on the Semantic 
Potential Theory of Language devised for an earlier study by the author (Fagan, 
1978). 


The basic tenet of this theory is that language is a system for providing 
information (linguistic) which is potential for meaning when interaction between 
the listener/reader and language occurs. In order to understand meaning, it is 
necessary to understand the linguistic data which provides input for the activation 
of perceptual and cognitive processes. Since all language output is input for the 
listener, it is necessary to account for each linguistic unit. 


The two units for initial division of the protocols were the utterance and the 
maze. The maze comprises (a) hesitation pauses (repeat and interjection) and (b) 
correction/edits. The utterance is either a t-unit (main clause and attached 
subordinate clauses) or an incomplete t-unit (incomplete in the sense that it lacks 
a subject, verb, required object, etc., and is therefore not structurally complete). 
An example is “The boy was running for the ship, from a policeman, anxious to 
reach the dock.” Utterances are then analysed for data which comprise four 
categories of linguistic information: denotational, relational, syntactic, and textual. 


Denotational information includes such information as nouns, verbs, 
prepositions, intensifiers, and negatives. Nouns may be related as subjects, objects, 
or complements, and these, with verbs, make up the Relational category. The 
sequential arrangement of the denotational and relational information constituted 
the category of Syntactic structures. Two basic structures are identified: 


(D) N V (Adj)(Adv)*(PP)* 


(D) N V (Adv)*(N)(N)(PP)* 
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* (Adv) may occur at any position within the sequence. 
(PP) may occur at any point within the sequence except 
immediately after the first N. 


Any other segments that could be written as basic structures were termed 
alternate syntactic structures, that is, an alternate form to the basic structure. 
These included such structures as “prenominal adjective” (The black book . . .), 
“clauses” (The boy who is in the boat . . .), “ing-nominalization” (While running 
through the fields, he. ..). A total of 24 such structures were identified. 


These structures were grouped into three sub-categories: relativization 
(intact), relativization (ellipsis), and complementation. This division was mainly in 
terms of their structural form but it does have implication for function. Examples 
are: 

Relativization (Intact) The boy who was ill will not attend 
school today. (adjective clause) 


Relativization (Ellipsis) They saw the boy jumping into the 
boat. (WH + aux/verb) 


Complementation They realized that they could not sail 
today. (that + S object) 


The fourth category, Textual Information, included referential connectives 
(pronouns, synonyms, class inclusion, etc.) and logical connectives (conjunction, 
causality, temporal disjunction, etc.). These two categories were also subgrouped 
(based on a study by Forster, 1978). Items like pronoun, synonym, and repetition 
made up Category I of the Referential connectives, while class inclusion, formal 
repetition, etc. made up Category II. Category III was the first subcategory of 
Logical Connectives and includes conjunction and temporal disjunction (and 
then). All other logical connectives (causality, condition, temporal conjunction, 
etc.) made up Category IV. 


Based on the findings of the previous study (Fagan, 1978), in order to provide 
a single design for this study and a companion study on oral language, and in an 
attempt to limit the analysis to selected parts of the theory, protocols were 
analysed for Denotational information (intensifiers and negatives), Syntactic 
information, and Textual information (referential and logical connectives). 


Hypotheses 


1. A general hypothesis was that there would be differences among the three 
groups although the direction of the differences was not definite. 

2. Findings from the 1978 study showed that the use of negatives and 
intensifiers increased over age levels. The author was curious as to whether 
this trend also occurred in written language. 

3. In the 1978 study, differences occurred between groups on seven of the 
twenty-four syntactic structures. The older children tended to use shorter 
type units and therefore used more alternate syntactic forms per utterance. A 
similar phenomenon has been regarded as a measure of syntactic maturity 
(Hunt, 1965). It was hypothesized that the older groups would use more 
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complex utterances (alternate syntactic forms per utterance) than younger 
groups. The categorization of alternate syntactic structures into subcategories 
was first tested in this study and consequently no directional hypotheses are 
given. 


4. Whereas previous studies have shown either some differences across age 
groups for Referential and Logical Connectives (Fagan, 1978) or that children 
at a particular grade level react differently to the subcategories (Forster, 
1978), the data are not definitive enough to permit directional hypotheses. 


Findings 
The analysis of the data for the three groups, teachers (N = 14), grade five (N 
= 20), and grade eleven (N = 16), are reported as means and standard 


deviations. For some variables, proportions to utterances are given. 

Utterances. When teachers and grade eleven students write, they produce 
significantly more utterances than do grade five students (see Table 1). Very few 
of these utterances are incomplete t-units. What few incomplete t-units are 
produced do not differentiate the three groups. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR UTTERANCES, 
T-UNITS, AND INCOMPLETE T-UNITS 


Means Sole, F-ratio 

Utterances 

Teachers Zor 21 2 Vh 

Grade 5 £455 6.07 

Grade 11 28.87 TM 2 10.50%* 
T-units 

Teachers Dees 2a Sal 

Grade 5 WIL tHe, 4.98 

Grade 11 DTP SV ile ley 14.68% 
Incomplete t-units 

Teachers 2.00 4.20 

Grade 5 Ladd 304 

Grade 11 L62 Ibe TAS! 0.62 


* Significant at the .01 level. 


Denotational Information. Grade eleven students produced significantly more 
intensifiers than did either the teachers or the grade five sample (Table 2). They 
also produced more per utterance than did the other groups (Table 3). Of every 
100 utterances, 19 contained intensifiers compared to 5 out of 100, and 6 out of 
100 for the teachers and grade five students, respectively. Grade eleven students 
also produced more negatives than any other group as well as more per utterance. 
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The difference was significant from the grade five means but not from those of 
the teachers. It has been hypothesized (Fagan, 1979) that intensifiers and 
negatives may serve a function of focus in oral language and their use may be 
developmental. This hypothesis may also apply to written language (Cameron, 
1979). Examples are: “Parts of Edmonton were threatened by flood waters, but 
other places were in extremely worse positions,” “The debris that was deposited 
along the river banks was very high in many places,” “Obstacles in the river were 
terribly dangerous to rescuers,” “The water causes millions of dollars in damage 
and destroys heirlooms which can never be replaced,” “No low-lying town should 
be without some kind of protection.” 


TABLE 2 


MEANS AND STANDARD DEVIATIONS 
FOR INTENSIFIERS AND NEGATIVES 


Means Seis F-ratio 

Intensifiers 

Teachers Lil: 1,43 

Grade 5 Leo oS 

Grade ll USTED) RIS 13.64% 
Negatives 

Teachers 92. e220 

Grade 5 Qe 55 103 

Grade 1l PaO) ke 52 10.00* 
* Significant at the .01 level. 

TABLE 3 


PROPORTIONAL MEANS AND STANDARD DEVIATIONS 
FOR INTENSIFIERS AND NEGATIVES 


Means SoDr F-ratio 

Intensifiers 

Teachers HONS; .04 

Grade 5 06 09 

Grade 11 19 gal? 9.44% 
Negatives 

Teachers 07 -02 

Grade 5 elie .07 

Grade ll a0) .04 9.20% 


* Significant at the .01 level. 
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Syntactic Structures. Teachers and grade eleven students differed significantly 
from grade five students on all syntactic measures for total amounts produced 
(Table 4). The grade eleven students differed from the teachers by producing 
significantly more alternate syntactic structures and relativization structures. A 
previous study on oral language development by the author (Fagan, 1979) showed 
that when speakers embed alternate structures within base structures they tend to 
accomplish this by using more of the relativization type structures. This same 
phenomenon appears to be happening in written language as well. 


TABLE 4 


MEANS AND STANDARD DEVIATIONS 
FOR SYNTACTIC STRUCTURES 


Means See F-ratio 

Basic structure 

Teachers DEP Al seal tt 

Grade 5 11.90 4.94 

Grade ll RS oS 10.80 16.19% 
Alternate syntactic structure 

Teachers 68.35 320i 

Grade 5 PBT} SYS) 1383 

Grade 11 84.62 SHO) fet i/ i 5 kes 
Relativization 

Teachers 34.00 Sie 

Grade 5 iM tshS 6 (5) 

Grade ll 50.50 PTET KG) 18.83% 
Ellipsis 

Teachers Deel he dls 

Grade 5 95 Tas’ 

Grade ll BS) ASI UE 8.36% 
Complementation 

Teachers 10.64 [as 5 Die! 

Grade 5 4.70 8201 

Grade 11 ORS WH One 8.08% 


* Significant at the .01 level. 


Whereas grade eleven students differed from teachers on the total number of 
alternate syntactic structures and the subcategory, relativization structures, this 
appears*to be accounted for by the amount of material written, since teachers and 
grade eleven students did not differ on these measures per utterance, although 
both groups were still significantly different from grade five pupils (Table 5). The 
fact that none of the groups differed significantly on the use of structures of the 
ellipsis or complementation type lends more support to the belief that more 
alternate structures are embedded in base structures at older age levels by using 
structures of the relativization type (clauses, adjectives, genitives, etc.) 
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TABLE 5 


PROPORTIONAL MEANS AND STANDARD DEVIATIONS 
FOR SYNTACTIC STRUCTURES 


Means SD F-ratio 

Alternate syntactic structures 

Teachers PLS TPP arent 

Grade 5 1.97 . 84 

Grade ll 2E85 oes) hails 
Relativization 

Teachers ils Shi, 46 

Grade 5 .83 the 

Grade 1l 1.48 58 8.44% 
Ellipsis 

Teachers sS)7 AoW 

Grade 5 82 44 

Grade 11 82 areal: 0.82 
Complementation 

Teachers 42 ails} 

Grade 5 ais} 5A, 

Grade 11 736 5 AUS) 0.98 


* Significant at the .01 level. 


Textual Information. Significant differences among the three groups occurred 
on all but one of the textual information variables (Group II Referential 
Connectives — class inclusion, inclusion, derivation, etc.). In all cases, the teachers 
and grade eleven students differed significantly from the grade five group. In two 
cases, the grade eleven students and teachers differed significantly — the former 
used significantly more Referential Connectives (in total) and more of the Group I 
category (pronouns, synonyms, repetitions) (Table 6). 

When textual information was analysed, not in terms of total amounts 
produced, but in terms of amounts proportionate to the utterance (Table 7), a 
somewhat different picture emerges. Grade eleven students differed significantly 
from teachers and the grade five group, and used significantly more information 
from Referential Connectives from Group I of this category and from Group III of 
the Logical Connectives. With respect to Referential Connectives, the grade eleven 
students used approximately 20 for every 10 utterances, whereas teachers and the 
grade five pupils used 15. It may be considered unusual that the grade eleven 
students, who tended to use more information in most categories, also used more 
of the information from subcategory III. This includes the conjunctions and and 
and then, which usually decline in use over age or grade levels (Fagan, 1978; 
Fagan, 1979; Cameron, 1979; Loban, 1976). Their decline in use is usually 
associated with their function of joining or stringing sentences or t-units together 
to the exclusion of other connectives. However, an analysis of the grade eleven 
protocols shows a different function. They were used mainly in the syntactic 
structures within the Ellipsis category, i.e., to reduce the number of words in 
conveying particular information. For example, 
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As the rains continue, and the river is swamped with excess water and stirred 
up debris, the only place this water can go is over the banks and into the 
surrounding areas. This causes the flooding of local SEONG Ge) fields, and 
very often the destruction of homes. 


Since grade eleven students used more Ellipsis structures than the other groups 
(Table 4), this appears to be an adequate explanation. 


The grade five pupils differed from the grade eleven students and from the 
teachers in their use per utterance of Group II Referential Connectives, and 
Logical Connectives. In the case of the former, they used significantly fewer. The 
latter finding is not unusual since the two older groups used more Logical 
connectives. However, Group II Referential Connectives (inclusion, class inclusion, 
derivation, formal repetition) tend to be used more sparingly and are considered 
more difficult than the connectives from Group I of this category. An analysis of 
grade five protocols showed that instances of inclusion were used more frequently 
from this subcategory. There were no instances of derivation and only very few 
instances of the other two connectives. Examples of the use of inclusion are: “. . .it 
was a frightening experience,” “. . .when it rains,” “When the river is really deeply 
gorged, it produces a young river.” 


TABLE 6 


MEANS AND STANDARD DEVIATIONS FOR TEXTUAL INFORMATION 


Means Sale F-ratio 

Referential Connectives 

Teachers ANG) FEAL DRM 

Grade 5 PME) 10.06 

Grade 11 59.00 23602 16.41* 
Group I 

Teachers Sys Tea 26-33 

Grade 5 IAS, SiS 9.78 

Grade 11 We tly ESV extehe! 16.07* 
Group II 

Teachers Wie GZ Silez 

Grade 5 7.90 Bree. 

Grade 11 Saleen Loh 0.19 
Logical Connectives 

Teachers HED Tf eh als 

Grade 5 The gH} 6.46 

Grade 11 26.87 eA 12.78% 
Group III 

Teachers 11.64 6.42 

Grade 5 7.00 4.63 

Grade 11 Gia Tete IAL Ser 
Group IV 

Teachers ia, al Tey 

Grade 5 3.90 BRS 

Grade 11 Ova, 705 7.98% 


a ee ne Sas ee 
i aaa A pa ge cg ee eee ee ee ee 


* Sieniticanteate the .O0l level. 
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TABLE 7 


Means -D. Beato 

Referential Connectives 

Teachers ig Sy ei 

Grade 5 156 <2. 

Grade 1l 2.08 39 6.60%** 
Group I 

Teachers ily Ad. Seif 

Grade 5 95 5) 

Grade 11 1.84 49 15.40** 
Group II 

Teachers =35 SANG 

Grade 5 Ai oy) 

Grade ll =33 pull, 6.16%** 
Logical Connectives 

Teachers 89 26 

Grade 5 WSs 28 

Grade ll 96 Pall 2.94% 
Group IIl 

Teachers sy, n2i2. 

Grade 5 745 50) 

Grade 11 Alay? ey 2.88% 
Group IV 

Teachers 42 18 

Grade 5 26 19 

Grade 11 35 2a 2.65 


* Significant at the .05 level. 
**k Significant at the .01 level. 


Summary of Findings 


Figure 1 can perhaps best provide an overall picture of the differences among 
the three groups on the variables tested. 


An examination of the data in Figure 1 shows that there was much more 
similarity between the grade eleven students and the teachers than between the 
grade five pupils and either of these groups. This finding is more obvious for the 
use of total information than for the use of information proportionate to the 
number of utterances. When differences did occur between the teachers and the 
grade eleven students, the latter always used more of the information being 
assessed. Only in one instance (Group II Referential Connectives proportionate to 
the number of utterances) did the grade five group use more information than the 


grade eleven students or the teachers. 
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Total Information Proportion/Utterance 


Variables Tested 


Number of variables 


Number of variables 
Grade 11 - Grade 5 
T - Grade 5 

Gralie— Gry 5 


* Grade 11 and grade 5 differed on fourteen variables when total information was analysed. 


Figure 1. Comparisons of differences among groups on variables tested. 


Discussion and Implications 


Does written language develop in individuals over the years from the grade five 
to the grade eleven level and to the level of practising teacher? In terms of the use 
of various kinds of linguistic information, there appears to be a definite 
development between students when they are in the elementary grades and when 
they are in high school. Some kinds of linguistic information do not seem to occur 
to any great extent in written language. As early as grade five, students have 
learned that an utterance in written language does not have the same degrees of 
freedom as utterances may have in spoken language (Fagan, 1978, 1979). The 
incomplete t-unit was rarely used and did not distinguish the three groups. An 
analysis of the data for correction/edit type mazes produced too few for analysis. 
Elementary school students and older subjects have also learned that this is not a 
written language phenomenon. 


A key question still needs to be asked — that is, does use of more of a 
particular kind of information suggest a maturity or complexity in development? 
Information, in and of itself, may not necessarily lead to such a conclusion. The 
nature of the information and its function (examples of its use) must be 
considered. Grade eleven students, for example, not only used more alternate 
syntactic structures per utterance, but also proportionately more referential 
connectives. The former information suggests more complex utterances, whereas 
the latter suggests one type of information likely used to embed these structures 
into more complex units. 


It has been pointed out that this study was restricted to an analysis of certain 
linguistic information (as defined by the Semantic Potential Theory of Language). 
However, other kinds of information used deserve some comment. 

The grade eleven sample (only) used headings to organize what they wrote 
(Causes, Results of Flooding), and diagrams to illustrate certain points. Diagrams 


228 


Longitudinal Study of Written Language 


were used to illustrate such points as “Farmers often plough at wrong angles to 
the low point of their land, and the water flows through these plough rivelets” or 
“A mature river is sloped like.” One function of such information is to act as 
advance organizers; that is, they allow the reader to build up a framework or 
knowledge structure before reading, which may be used during the course of 
reading to better synthesize the information presented in running text 
(Karahalios, Tonjes & Towner, 1979). 


Before one draws the conclusion that the grade eleven students are superior to 
the teachers in their production of written language, other factors must be 
considered. The audience is one such factor. A study by Crowhurst and Piche 
(1979) showed that when grade ten students wrote in the argument mode, the 
syntactic complexity differed depending on whether the audience was a teacher or 
a best friend. The teachers in the present study were instructed to write for grade 
five pupils, whereas the grade eleven students wrote for their peers. However, this 
must be tempered by the comment that since the teachers differed significantly on 
most variables from the grade five group and did not use aids like headings and 
diagrams, their writing may not be most suitable for their audience. One 
conclusion that can be drawn is that by the time grade five pupils reach grade 
eleven, their manner of writing is significantly different. Other data are needed 
before it can be stated whether this standard is adequate for grade eleven. 


The interweaving of thoughts and their expression have been considered an 
indicator of language maturity (Fagan, 1978; Hunt, 1976; Loban, 1976; O’Donnell, 
Griffin, and Norris, 1967) Proponents of discourse analysis (Drum, 1974; 
Frederiksen, 1975; Grimes, 1972) advocate that interweaving of thoughts must be 
considered at a level greater than the sentence. This study and the Semantic 
Potential Theory on which it is based point up types of linguistic information 
which may be used to achieve this end. 
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Low Achievers’ Confidence 
in Their Computational Algorithms 


Data were collected from students in Grades 5 through 8 to document the 
existence of inappropriately high levels of confidence in incorrect computational 
methods among low achievers. The data consisted of a score on a computational 
skills test and a score on an algorithm confidence item for each of the four basic 
operations with whole numbers. The percentage of low achievers classified as 
having high algorithm confidence was: addition, 85%; subtraction, 79%; 
multiplication, 70%; and division, 52%. A second sample of low achievers was 
interviewed and the subjects were able to state justifications for their high 
levels of algorithm confidence. Possible explanations and implications of the 
results are discussed. 


A substantial number of students fail to master the basic computational skills 
required of them in mathematics classes in elementary school. While this situation 
is not new, there appears to be renewed interest among _ teachers, 
teacher-educators, and educational researchers in investigating both the causes of 
students’ failure to learn these procedures as well as in the development of various 
aspects of the remedial process. 


One possible parameter of the remedial process concerns the student’s degree 
of confidence in his ability to perform the operation in question. That is, a low 
achiever’s degree of confidence in his ability to apply correctly one or more of the 
computational algorithms, whether that degree of confidence is too high or too 
low, may be an important factor in determining the eventual success or failure of 
a remedial program. 
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Prescriptive Mathematics. Both men are on the contract team for the 1981 B.C. Mathematics Assessment. 
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Singer (1977), in his review of research concerning instructional strategies for 
psychomotor skills, cites findings which suggest that “the more confidence an 
individual places in a response, the more difficult it will be to unlearn if it is 
subsequently shown to be incorrect.” Macraw and Holding (Taylor, 1970) conclude 
from their research that “the degree of confidence or conviction with which a 
response is selected or performed may be a basic variable in this process 
(subsequent learning)” (p. 198). 

Erlwanger (1974), in his study of six children’s conceptions of mathematics, 
found that “in general the children’s conceptions underlying their behavior appear 
to be stable, cohesive, conceptual systems” (p. 277). One of the children, Benny, 
expressed the belief that the answer to an exercise depended upon the method 
employed but that the different answers he might obtain were equivalent and, 
therefore, equally correct. 

The authors’ clinical experience with students whose computational skills were 
weak has shown that such students frequently profess a high degree of confidence 
in their computational methods, even though those methods may result in 
incorrect responses. The authors’ experience has also shown that such students 
frequently have developed incorrect, idiosyncratic algorithms which they apply 
with a high degree of consistency. 

An example of one such idiosyncratic algorithm is shown below. In this 
subtraction algorithm, the student simply finds the absolute difference of each 
pair of digits, regardless of the relative positions of the digits in the exercise. 


8173 
-6839 
2746 


The study reported in this paper was designed to seek evidence regarding 
students’ expressed degree of confidence in their use of the computational 
algorithms for whole numbers. In particular, the purpose of the study was to 
document the existence of inappropriately high levels of confidence in incorrect 
computational methods among low achievers. 


Method 


All of the students in Grades 5 to 8 in a suburban Vancouver, British 
Columbia school district participated in the first phase of the study. Of the 5700 
students who were present at the time the testing was done, 5 440 (95.4%) 
provided complete data. The other 260 cases were not used as part of the analysis. 

Investigator-developed instruments were utilized for both the measure of 
algorithm confidence and the measure of computational ability. The former 
consisted of four Likert-type items, one for each of the whole number operations 
of addition, subtraction, multiplication, and division (See Figure 1 for an 
example). Each response was given a score from 1 to 5, with the higher scores 
indicating higher levels of algorithm confidence. The Hoyt estimate of reliability 
for the algorithm confidence instrument was 0.83. 

It should be noted that the algorithm confidence items required students to 
evaluate the correctness of their computational methods rather than the 
correctness of their answers. In other words, the study represented an attempt to 
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address the process utilized rather than the product or result obtained. While a 
case might be made for either alternative, the process information would appear to 
have a greater likelihood of providing significant information for planning any 
remedial work which might be required than would the corresponding product 
information. Moreover, there is support in the literature (Radatz, 1979) as well as 
in the analysis of students’ errors in the project in which the data were collected 
for the present study to support the contention that a significant proportion of 
students’ errors in computation are both consistent and process-related. 


How sure are you that your way of ADDING is correct? 


. L'm positive my way is correct. 
. I'm pretty sure my way is correct. 
. I don't know if my way is correct. 


. I'm pretty sure my way is wrong. 


Ret te OY 


. I'm positive my way is wrong. 


Figure 1. Algorithm confidence item for addition. 


The 44-item test of computational skill was administered immediately after the 
algorithm confidence instrument. The test consisted of four subtests, one for each 
of the four basic operations with whole numbers. The division subtest consisted of 
8 items while each of the other three contained 12 items. Items for each subtest 
were selected to correspond to the various levels of generality of application of the 
algorithm involved. For example, the division items ranged from an example 
involving a multi-digit divisor with zero in the quotient and a remainder, to one 
with a one-digit divisor and no remainder. The computation test was pilot-tested 
in a small number of classes to verify the appropriateness of the items and to 
ascertain the amount of time needed by students to complete the test. The 
computation test was administered after the confidence instrument since it was 
felt that otherwise the subjects’ responses might be unduly influenced by their 
perceptions of their ability to obtain the correct answers. 


Students were given tables of addition and multiplication basic facts to use 
during the test. This was done in an attempt to minimize the number of errors 
caused by lack of mastery of basic facts, thereby making it possible to concentrate 
upon errors due to lack of mastery of the processes involved. No time limit was 
imposed for completion of the test. The Hoyt estimates of reliability for the 
computation tests where as follows: addition subtest, 0.80; subtraction, 0.76; 
multiplication, 0.78; division, 0.84; total test, 0.88. All of the tests were scored by 
hand as part of a larger project designed to catalogue the different categories of 
computational errors made by students. 


For a given operation, a low achiever was defined to be a student whose score 
on that subtest was more than one standard deviation below the mean score for 
that grade. A student was included in the high algorithm confidence group for a 
given operation if he selected either of the positive responses (1 or 2) on the 
corresponding confidence item. Those who selected either of the negative 
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responses (4 or 5) on that item were classified as members of the low algorithm 
confidence group for that operation. 


The algorithm confidence data were collected by means of a self-report 
instrument, and Ziller, Hagey and Smith (1969) have commented on the difficulty 
of obtaining valid responses using such a method. In order to verify the validity of 
the algorithm confidence data, the algorithm confidence instrument and the 
achievement test were also administered to all of the students in Grades 5 to 7 (N 
= 126) from a school in another school district. Of those who took the tests, 19 
(15.1%) were selected to be interviewed about their self-report responses. The 
criteria used to select students for follow-up interviews were that the students had 
stated that they were “positive” their method of performing one of the operations 
was correct, and that they had been categorized as low achievers on the 
corresponding achievement subtest. One week later, an interviewer readministered 
the algorithm confidence instrument to those students to be interviewed and then, 
without informing them of their achievement subtest results, questioned them 
about their response on the self-report instrument. One of the 19 students was 
absent on the day of the interviews and the first three interviews were used to 
standardize procedures. The remaining 15 subjects formed the interview sample. A 
more detailed discussion of the algorithm confidence validation phase of the 
project is contained in Feghali (1976). 


Results 


In Table 1, summary statistics for the achievement subtests are presented. 
Since computational skills are taught for mastery, it is to be expected that the 
results would be as good as they are. Also, as expected, the results show that 
students’ performance in whole number computation improves with each grade 
and that they have more difficulty with division than with any of the other three 
basic operations with whole numbers. 


TABLE 1 


SUMMARY STATISTICS FOR ACHIEVEMENT TESTS 


No. of ‘Addition Subtraction Multiplication Division 
Coase Students Mean SDs Mean Se Di Mean S.D. Mean Die Die 
5) 1 360 10 ils 7 TONG Bos all BSS) 4.1 Qo 
6 1 474 ete 2 IEE 10.4 0) 10.0 Pee Bip) 2.4 
i 1 411 lth ss} VWewlan IOs thehs, MCs 5) ibe’, Sys) Lied 
8 iL RSs 17.4 1.4 10.8 ir. & ONG IAS) 6.0 eal 


Table 2 contains the numbers of low achievers, by grade and by operation. As 
with the data in Table 1, they reflect the generally increasing difficulty of the 
operations from addition, to subtraction, to multiplication, to division. They also 
show that the size of the low achieving group is small when compared to the total 
population. 
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TABLE 2 


FREQUENCY DISTRIBUTION OF LOW ACHIEVERS 
BY GRADE AND BY OPERATION 


Grade Addition Subtraction Multiplication Division Total 


5 ae? 174 236 334 856 

6 74 207 156 250 687 

7 49 170 154 238 611 

8 Sy 183 189 163 567 
Total 267 734 735 985 


A summary of the algorithm confidence data is presented in Table 3, in terms 
of percents. Students who were high achievers were predominantly highly 
confident, as was to be expected. However, since it is the inappropriately high 
confidence levels of low achievers that is the focus of this paper, only data for low 
achievers is presented in Table 3. Students who declared themselves to be either 
“positive” or “pretty sure” that their method was correct were classified as having 
a high algorithm confidence (HAC). Those who said they were either “pretty sure” 
or “positive” their method was not correct were classified as having low algorithm 
confidence (LAC). 


TABLE 3 


ALGORITHM CONFIDENCE RESULTS FOR LOW ACHIEVERS (PERCENT) 


Grade Addition) — Subtraction Multiplication Division 
LAC@ HAC LAC _HAC / LAC _____ HAC LAC HAC 
> 2 77 6 61 13 54 27 31 
2 : 91 2 74 7 68 20 42 
? 2 92 2 89 3 81 AW) 59 
8 s 88 2 91 1 82 14 56 


LAC: Low Algorithm Confidence; HAC: High Algorithm Confidence 


In all 16 results (4 grades x 4 operations), more than 50% of low achievers 
have high algorithm confidence ratings. It is clear that low achievers in arithmetic 
computation with whole numbers have unrealistically high levels of algorithm 
confidence. In fact, in most grades and for most operations, this is true of the 
overwhelming majority of these low achievers. 


The percentages of low achievers who expressed high algorithm confidence are 
all high for the operations of addition, subtraction, and multiplication. Only in the 
case of division did a substantial proportion in each grade express a low degree of 
confidence in their computational method. Further analysis of these data 
indicated that approximately one out of six of the low achievers in division had 
not attempted any of the division exercises. Since the pilot testing of the 
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instruments had shown that virtually all students had sufficient time in which to 
complete the test, it is unlikely that these low achievers failed to complete the 
division items because of a lack of time. It seems more appropriate to conclude 
that these students had no idea how to do the division exercises and, as a result, it 
is not surprising that they would have low levels of algorithm confidence. Even for 
division, however, the ratio of HAC subjects to LAC subjects was 5 to 2. 


The data in Table 3 also show that, within each grade, there is a tendency for 
the percentage of low achievers with high algorithm confidence to decrease from 
addition, to subtraction, to multiplication, to division. On the one hand, this may 
be a reflection of the increasing difficulty of the algorithms for these operations, 
and the results in Table 1 would lend support to such a view. On the other hand, 
since the operations are customarily taught in the order listed above, these results 
may be interpreted to mean that there is a direct relationship between the length 
of time a low achiever has worked with an algorithm, even an incorrect one, and 
that student’s algorithm confidence. This latter view is supported by the fact that, 
for each operation, the data show that the proportion of low achievers having high 
algorithm confidence scores generally tends to increase from grade to grade. 


The final set of results concerns the 15 students who were interviewed to 
determine what low achievers meant by their high algorithm confidence responses. 
The data in Table 4 show that 11 of the 15 low achievers interviewed were 
“positive,” on both testing occasions, that their method of performing the 
operation in question was correct. The other four changed from being “positive” to 
“pretty sure.” None of them switched to a negative (scores of 4 and 5) or 
inconclusive (score of 3) algorithm confidence category. 


TABLE 4 


DATA ON INTERVIEWEES 


Achievement Test Algorithm Confidence Scores 

Subject Grade Operation Rr ee oa Sik ccttaals ae ac 
I 5) x 66.7 5 4 
2 6 + al/52) 5) 5 
3 6 + 62.5 > 5 
4 6 + (258 5 5 
5 6 + DS) 5 5) 
6 6 + 50 5) 4 
7 6 - 1) 5 5 
8 6 x US) 5 D 
sy 6 x 66.7 5 5 
10 7 x 41.7 5 a) 
ial 7 + Bie 5 4 
2, 7 x 50 5) 4 
is ii = 79 3) 5 
14 i - 66.7 5 5 
IS) Ii - 75 5 5 
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During subsequent interviews, the four students who had changed their 
algorithm confidence response were asked to give reasons for the change. All of 
them said that in the week between the two administrations of the confidence 
instrument they had either had time to rethink their answer or had received some 
new evidence upon which to base their opinions. 


All fifteen of the interviewees were asked the following question: “What makes 
you so sure your way of adding (subtracting, multiplying, or dividing) is correct?” 
Although some students gave more than one reason, the results may be 
summarized as follows. Five students declared that they used the method they 
had been taught; four stated that teachers, peers, or textbooks used the method 
they did; four said that some or most of their answers were correct; two said their 
method usually “worked out”; two others claimed they had been using the same 
method for a long time; one simply stated that he knew how to do it; and one said 
that his way was the only correct one. In summary, the interviews showed that 
these students really believed their computational methods were correct, in spite 
of their low achievement levels. The interviews support the view that the data 
obtained on the self-report instrument were valid. 


Discussion 


The results obtained in this study should be of interest to educators who are in 
some way involved in the design or delivery of remedial programs in elementary 
mathematics. Briefly, the results show that low achievers in computation with 
whole numbers frequently profess a high degree of confidence in their ability to 
perform such tasks. This situation raises the question of the significance of a 
student’s algorithm confidence level for subsequent remedial work. Future studies 
might be designed, for example, to investigate the extent to which students’ 
inappropriately high algorithm confidence levels mediate against the success of 
remedial measures. 


One possible explanation of the low achievers’ high algorithm confidence 
ratings may be that the students’ self-expectations are not as high as their 
teachers’ expectations of them. Generally speaking, teachers expect students to 
exhibit a high degree of mastery of computational skills with whole numbers; 
however, many of the low achieving students who were interviewed seemed rather 
content with their performances. Six of the fifteen interviewees stated explicitly 
that their algorithm confidence level was high because they usually obtained a 
satisfactory (to them) proportion of correct answers, in this case about 50%. 
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Predictive Validity of an Individualised 
Classroom Environment Questionnaire 


A strong research tradition in numerous countries has established the predictive 
validity of student perceptions of classroom learning environment in accounting 
for appreciable amounts of variance in cognitive and affective outcomes, often 
beyond that attributable to corresponding pretest performance. The present 
study explored the predictive validity of a new instrument, the Individualised 
Classroom Environment Questionnaire (ICEQ), which measures perceptions of 
five dimensions (Personalisation, Participation, Independence, Investigation, 
and Differentiation) which distinguish individualised classrooms from 
conventional ones. A sample of 320 Grade 7 to 9 students in 14 science 
classrooms provided responses to the ICEQ at mid-year and to seven 
science-related attitude scales, both at the beginning and end of the year. 
Simple, multiple, semipartial multiple, and canonical correlation analyses 
provided consistently strong support for the predictive validity of the ICEQ. 


Over the previous ten to fifteen years, considerable interest has been shown 
internationally in the conceptualisation, measurement and investigation of student 
perceptions of psychosocial characteristics of their classroom learning environment 
(see Anderson & Walberg, 1974; Fraser, 1980a, 1980b; Fraser & Walberg, in press; 
Haertel, Walberg & Haertel, 1979; Moos, 1979; Walberg, 1976, 1979), Although the 
largest proportion of this work has been carried out in the United States, a 
significant amount of research has also been conducted in Canada, Australia, and 
Israel, and a small number of key studies have been done in developing countries 
including India, Indonesia, Brazil, and Thailand. Furthermore, several key studies 
in the area have been reported in this journal (Anderson & Walberg, 1972; 
O’Reilly, 1975; Walberg & Singh, 1974). 


Barry J. Fraser (PhD, Monash University, Melbourne) is a Senior Lecturer in Education at Macquarie University, Sydney. His 
research interests include learning environments, educational evaluation, and science education. He is author of the Pergamon 
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The two most widely used instruments in prior research are the Learning 
Environment Inventory (LEI) (Anderson & Walberg, 1974, 1976) and the 
Classroom Environment Scale (CES) (Moos & Trickett, 1974; Trickett & Moos, 
1973). The LEI was developed originally as part of the research and evaluation 
activities cf Harvard Project Physics, whereas the development of the CES grew 
out of a programme of research in other milieus including psychiatric hospital 
wards, university residences, families, correctional facilities and therapeutic groups 
(Moos, 1976). Examples of the dimensions measured by these instruments are 
Cohesiveness, Friction, Speed, Difficulty, Democracy, Affiliation, Competition, and 
Innovation. Nevertheless, despite wide usage of the LEI and CES, these 
instruments have the shortcoming that they neglect dimensions that are salient in 
classroom settings variously referred to as individualised, open, or inquiry-based. 
Consequently, Rentoul and Fraser recently developed a new instrument called the 
Individualised Classroom Environment Questionnaire (ICEQ) to measure student 
perceptions of five environmental dimensions which differentiate individualised 
classrooms from conventional ones (Rentoul & Fraser, 1979; Fraser, 1980a). The 
dimensions measured by the ICEQ are Personalisation, Participation, 
Independence, Investigation, and Differentiation. 

Some prior research has involved the use of classroom environment variables 
as dependent variables. This research has revealed that, when used in curriculum 
evaluation studies, classroom environment characteristics differentiated 
revealingly, usefully, and appreciably between classrooms following alternative 
curriculum materials or teaching strategies (Anderson, Walberg & Welch, 1969; 
Cort, 1979; Fraser, 1979; Levin, 1980). Other studies have established the criterion 
validity of classroom environment perceptions in differentiating between 
classrooms varying in class size (Walberg, 1969a; Anderson & Walberg, 1972), 
grade level (Welch, 1979) and subject matter (Anderson, 1971; Kuert, 1979). The 
strongest tradition in past research on classroom environment, however, has 
involved the predictive validity of student perceptions (i.e., ability to predict 
student cognitive and affective learning outcomes). Furthermore a large body of 
past predictive validity studies has provided convincing and consistent support for 
the predictive validity of student perceptions in accounting for appreciable 
amounts of variance in learning outcomes, often beyond that attributable to 
student characteristics such as pretest performance and general ability. 


One purpose of the present paper is to make the tradition of prior classroom 
environment research more accessible to wider audiences by providing a brief 
overview of past work. The primary purpose, however, is to report the first main 
study of ‘the predictive validity of the ICEQ, a new measure of environment 
dimensions salient in individualised settings. 


Past Predictive Validity Studies 


Past studies have involved a variety of subject areas and grade levels and have 
reflected numerous methodological variations including sample size, unit of 
statistical analysis, background variables controlled, and data analysis techniques. 
In general, however, these studies have involved the prediction of student learning 
outcomes measured at the end of a school year from student perceptions of the 
classroom environment at mid-year, often with statistical control for corresponding 
beginning-of-year learning outcome performance. 
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A lot of the earliest predictive validity studies involved the administration of 
the LEI to senior high school classes in the U.S. studying Harvard Project Physics 
materials. The predictive validity of student perceptions was supported in a study 
of individual student learning outcomes and individual student environmental 
perceptions (Walberg & Anderson, 1968), class mean outcomes and class mean 
environment perceptions (Anderson & Walberg, 1968), and individual outcomes 
and class mean environment perceptions (Anderson, 1970). Using a variety of 
units of analysis and other methodological variations, Walberg (1969b, 1969c, 
1972) further established the predictive validity of the LEI among senior high 
school physics classes in the U.S. 


Further support for the predictive validity of versions of the LEI in U.S. 
classrooms comes from Lawrenz’s (1976) study of 238 science classes and Cort’s 
study of 81 social studies classes. In Australia, a series of studies involving 
simplified versions of the LEI among junior high school science students 
demonstrated that classroom environment variables accounted for appreciable 
increments in variance in several cognitive and attitudinal measures beyond that 
attributable to corresponding pretest and general ability (Fraser, 1978a, 1979; 
Fisher & Fraser, 1980). In Canada, high school students’ perceptions on the LEI 
have been found to be significantly related to examination achievement in a study 
of eight different subject areas (Walberg & Anderson, 1972) and in a study of 
mathematics classes (O’Reilly, 1975). In Israel, Hofstein, Gluzman, Ben-Zvi and 
Samuel (1979) have established associations between Grade 11 chemistry students’ 
perceptions on the LEI and some attitudinal outcomes. 


Moos has used the CES with various samples of secondary school students in 
the U.S. to establish relationships between student perceptions of classroom 
environment and the outcomes of student satisfactions and moods (Trickett & 
Moos, 1974; Moos, 1978), student absences and grades (Moos & Moos, 1978), and 
student reactions (Moos, 1979). Brookover found that U.S. elementary school 
students’ perceptions of five characteristics of the school environment were 
associated with academic achievement (Brookover & Schneider, 1975; Brookover, 


Schweitzer, Schneider, Beady, Flood & Wisenbaker, 1978). McDill, Meyers and 
Rigsby (1967, 1969) used factor-analytic scales based on the College 
Characteristics Index and the High School Characteristics Index (Stern, 1970) 
with a sample of over 20 000 U.S. high school students to establish relationships 
between student perceptions and the outcomes of mathematics achievement and 
college plans. Scales based on the College Characteristics Index have also been 
used in Australia to establish relationships between student perceptions and 
several science-related attitudes among eleventh grade physics students (Gardner, 
1976). 


Studies conducted in a number of developing countries have supported the 
cross-cultural predictive validity of student perceptions of classroom learning 
environment. Walberg, Singh and Rasher’s (1977) study of 83 science and 67 social 
science classes at the tenth grade level in Rajasthan, India, revealed strong 
associations between perceptions on a Hindi version of the LEI and end-of-course 
achievement. In Brazil, Holsinger (1972, 1973) used a translated version of the 
LEI to show that classroom environment was a powerful predictor of the cognitive 
outcome of information learning and the noncognitive outcome of individual 
modernity among a sample of 2 533 Grade 3 to 5 students. Chatiyanonda (1978) 
used a Thai version of the LEI with a sample of 31 twelfth grade physics classes 
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in Bangkok or nearby provinces to establish that student perceptions accounted 
for significant increments in the variance in some attitude outcomes beyond that 
attributable to beginning-of-year attitude scores. In East Java, Indonesia, Paige 
(1978) used an environment instrument based on the LEI and CES to provide 
evidence supporting the predictability of achievement and modernity from the 
classroom perceptions of 60 sixth-grade classes. 


A recent meta-analysis (Haertel, Walberg & Haertel, 1979) provides additional 
strong support for the predictive validity of students’ classroom environment 
perceptions. This research synthesis involved analysis of 734 correlations obtained 
from 12 studies of 10 data sets in 8 subject areas, encompassing 17 805 students in 
823 classes in four nations. Learning outcomes were found to be positively 
associated with student-perceived cohesiveness, satisfaction, task difficulty, 
formality, goal direction, democracy, and material environment, and negatively 
associated with friction, cliqueness, apathy, and disorganisation. Furthermore, the 
strengths of the outcome-environment relationships were studied as a function of 
various substantive and methodological characteristics of the studies from which 
the correlations were taken. It was concluded that the magnitude of the 
outcome-environment correlation was significantly associated with the dimension 
of classroom environment considered, the unit of statistical analysis, and the 
nation in which the study was ‘conducted. Correlation size, however, did not 
depend upon the sample size, subject matter, type of learning outcome, or 
presence of control for pretest and general ability. 


While there is plenty of evidence from past research to support the predictive 
validity of various classroom environment instruments, especially the LEI and 
CES, only a single study of the ICEQ’s predictive validity has been undertaken 
previously. This study (Rentoul & Fraser, 1980), which involved a sample of 285 
science and social science students in 15 junior high school classes in Australia, 
revealed significant associations between student perceptions on the ICEQ and 
affective (but not cognitive) outcomes. In view of these results, the present study 
was instigated to explore further relationships between perceptions of the ICEQ 
and a range of attitudinal outcomes. Before discussing the design and results of 
the study itself, attention in the next section focusses briefly on the 
conceptualisation, development, and validation of the ICEQ. 


Development of ICEQ 


The initial development of the ICEQ (Individualised Classroom Environment 
Questionnaire), which has been discussed in detail in Rentoul and Fraser (1979) 
and Fraser (1980a), was guided by the following criteria: 

1. Dimensions chosen characterised the classroom learning environment 
described in the literature of individualised education, including open and 
inquiry-based classrooms (Elliott & Adelman, 1975; Rathbone, 1971; Walberg 
& Thomas, 1972; Weisgerber, 1971). 

2. Dimensions chosen provided coverage of the three general categories of 
dimensions delineated by Moos (Insel & Moos, 1974; Moos, 1974) for 
conceptualising human environments. These three general categories are 
Relationship Dimensions (nature and intensity of personal relationships), 
Personal Development Dimensions (basic directions along which personal 
growth and self-enhancement tend to occur) and System Maintenance and 
System Change Dimensions (extent to which the environment is orderly, 
clear in expectation, maintains control, and is responsive to change). 
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8 Dimensions chosen and individual questionnaire items were considered 
salient and suitable by a group of educational researchers, practising 
teachers, and students. 

Extensive literature analysis and interviewing of teachers and students 
revealed that the above criteria could be satisfied with the following five 
dimensions: Personalisation, Participation, Independence, Investigation, and 
Differentiation. Items measuring each dimension were written and later modified 
after receiving reactions from researchers, teachers, and students. The scales were 
further refined to form a final version by application of item analysis techniques 
to data collected from samples of students (see Rentoul & Fraser, 1979). 


The final version of the ICEQ contains 50 items, with each of the five 
dimensions being assessed by 10 items. Each item is scored on a five-point scale 
with the responses of Almost Never, Seldom, Sometimes, Often, and Very Often. 
The scoring direction is reversed for approximately half of the items. Table 1 
clarifies the meaning of each ICEQ scale by providing its classification according 
to Moos’ scheme and a sample item from each scale. 


TABLE 1 


DESCRIPTIVE INFORMATION AND VALIDATION DATA 
(ALPHA RELIABILITY, MEAN CORRELATION WITH OTHER SCALES, 
ANOVA RESULTS FOR CLASS MEMBERSHIP DIFFERENCES) FOR ICEQ 


ania Descriptive Information Seale Statistics ANOVA Results 
Mean 
Moos! Correlation 
General Sample Item Alpha with other ip Eta 
Category Rel. Scales 
Personalisation Relationship The teacher considers 0.79 0.28 5.0% Layette) 


students! feelings. (+) 


Participation Relationship The teacher lectures 0.70 Om27 4.2% Ome 
without students asking or 
answering questions. (-) 


Independence Personal Students choose their Os 71 0.09 10.4% 0.33 
Development partners for group work. (+) 

Investigation Personal Students find out answers 0.69 0.16 3.9% ORS 
Development to questions and problems 


from the teacher rather than 
from investigations. (-) 


Differentiation System Different students use 0.61 0.07 6.6* 0.24 
Maintenance different books, equipment, 
and materials. (+) 


Note. Items designated (+) are scored 1, 2, 3, 4, 5, respectively for the responses Almost Never, 
Seldom, Sometimes, Often, Very Often. 


Items designated (-) are scored in the reverse manner. 


Scale statistics were calculated using the class mean as the unit of analysis. 


kp< OL 


Table 1 also summarizes some important scale validation statistics which were 
calculated for the ICEQ using the sample of 34 Australian classes used in a 
previous study (see Fraser, 1980a). This sample consisted of 766 Grade 7 to 9 
students in 34 separate classes, each in a different government high school spread 
widely throughout Sydney suburbs and country areas of New South Wales. In 
particular, Table 1 provides information about scale internal consistency (the 
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extent to which a given scale measures the same dimension) and discriminant 
validity (the extent to which a given scale measures a unique dimension not 
measured by other scales in the instrument). While the alpha coefficient is used as 
an index of internal consistency, the mean correlation of a scale with the other 
four scales is used as an index of discriminant validity. Table 1 shows that the 
value of the alpha coefficient ranged from 0.61 to 0.79 for different scales, thus 
suggesting that each ICEQ scale has adequate internal consistency reliability. This 
table also shows that the value of the mean correlation of a scale with the other 
scales ranged from 0.07 to 0.28. This suggests that ICEQ scales possess 
satisfactory discriminant validity and that they measure distinct, although 
somewhat overlapping, aspects of classroom environment. 

Another desirable characteristic of any classroom environment instrument is 
that it be capable of differentiating between the perceptions of students in 
different classrooms. That is, students within the same class should perceive their 
class relatively similarly, while mean within-class perceptions should vary from 
classroom to classroom. This characteristic was explored for each ICEQ scale with 
the previous sample of 766 students by performing a one-way ANOVA, with class 
membership as the main effect and using the individual as the unit of analysis, to 
obtain information about the ratio of between-class to within-class variance. The 
results of these analyses are shown in Table 1, which indicates that each ICEQ 
scale differentiated significantly (p<.01) between classrooms. Furthermore the 
eta’ statistic, which is the ratio of between to total sums of squares (Cohen & 
Cohen, 1975), was calculated as an estimate of the amount of variance in ICEQ 
scores attributable to class membership. Table 1 shows that the proportion of 
variance accounted for by class membership ranged from 15 per cent for the 
Investigation scale to 33 per cent for the Independence scale. 


Design and Results of Study 


The present study of the predictive validity of the ICEQ involved a sample of 
320 Grade 7 to 9 students in 14 science classes, each with a different teacher. 
These classes were spread throughout six representative schools (two private and 
four government) in suburban areas of Sydney, Australia. Approximately equal 
numbers of boys and girls made up the sample. 

The student outcomes studied were a set of seven attitudinal criteria measured 
by the Test of Science-Related Attitudes (TOSRA) (Fraser, 1978b, 1980c). The 
seven scales are Social Implications of Science (attitude to the social benefits and 
problems which accompany scientific progress), Normality of Scientists 
(appreciation that scientists are normal people rather than the eccentrics often 
depicted in the mass media), Attitude to Inquiry (attitude to scientific 
experimentation as a way of obtaining information about the natural world), 
Adoption of Scientific Attitudes (e.g., open-mindedness, willingness to revise 
opinions in the light of new evidence), Enjoyment of Science Lessons, Leisure 
Interest in Science, and Career Interest in Science. Alpha reliability coefficients 
for TOSRA scales were reported to range from 0.67 to 0.92 for a sample of 1 337 
Australian students in Grades 7 to 10 (Fraser, 1980c). 

The sample of 320 students responded to TOSRA at the beginning of the 1979 
school year and again at the end of the same year. Student perceptions of 
classroom learning environment were measured with the ICEQ in the middle of 
the same year. This experimental design permitted examination of the 
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predictability of end-of-year attitudes from classroom environment perceptions 
when corresponding beginning-of-year attitude scores were controlled statistically. 


A notable feature of prior research into the predictive validity of students’ 
classroom environment perceptions is that a variety of data analytic techniques 
has been used in exploring outcome-environment relationships. In the present 
study, it was decided to analyse data and report results in five different ways 
which reflect emphases in previous studies. Results obtained using these five 
methods of analysis are recorded in Table 2 for the sample of 320 students. 


TABLE 2 


SIMPLE, MULTIPLE, SEMIPARTIAL MULTIPLE, AND CANONICAL 
CORRELATION BETWEEN ATTITUDE AND ENVIRONMENT DIMENSIONS 


— a = — 


Simple Correlation, r Multiple Semipartial 
S Se : : 
ae Pers Part Indep Invest Diff Pop elas ten Mol efpis a 
Correlation 
R sR 
Social Implications 0.31%** 0.24%** -0.01 0.25** -0.02 0.35%** 0.25** 
of Science 
Normality of Scientists 0.18** 0. 26** 0.10 0), 13% 0.04 0.30** 0.26%** 
Attitude to Inquiry 0.08 0.16** -0.08 0.02 -0.18%* 0.24%* 0.22** 
Adoption of 0.28%* 0.24%** -0.05 0.18** -0.09 0.30%** 0.23%* 
Scientific Attitudes 
Enjoyment of 0.32** 0.26%** 0.02 0.23%** -0.07 0.37%** 0.27%* 
Science Lessons 
Leisure Interest 0.29%** 0.24%* -0.04 0.19%** -0.13* 0.32%* 0.23** 
in Science 
Career Interest 0.27%** 0.23%* -0.02 0.21%** -0.04 0.30** 0.22%* 
in Science 
Canonical Correlation: R, = 0.43** Re = 0.40** 
(Standard raw (Standard 
posttest residual 
scores) posttest 
scores)@ 


@2 Involves adjustment for corresponding pretest. 


kp < .05 
#k p< .01 


The first and least complex analysis reported in Table 2 is a simple 
correlational analysis involving scores on the seven attitude posttests and the five 
ICEQ scales. Results shown in Table 2 indicate that 21 of the possible 35 
correlations were significantly greater than zero (p<.05), which is 12 times that 
expected by chance. 


Although the simple correlational analysis can provide useful information 
about relationships between particular attitudinal outcomes and particular 
environmental dimensions, the multiple correlation of the set of ICEQ scales with 
each attitude posttest can be considered more revealing. In particular, the 
multiple correlation provides a more parsimonious picture of the joint influence of 
correlated environment dimensions on outcomes, and reduces the experimentwise 
Type I error rate associated with the simple correlational analysis. Table 2 shows 
that the magnitude of the multiple correlation between scores on an attitude 
posttest and the set of ICEQ scales ranged from 0.24 for the Attitude to Inquiry 
scale to 0.37 for the Enjoyment of Science Lessons scale. Moreover these values 
were significantly greater than zero (p<.01) for each of the seven TOSRA scales. 
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It has been common in past predictive validity research to perform a 
conservative test of outcome-environment relationships by controlling statistically 
certain student characteristics, particularly performance on the parallel pretest. 
That is, for reasons of simplicity, learning environment dimensions have been 
considered useful predictors of learning outcomes only if they accounted for 
different variance from that attributable to well-established predictors like pretest. 
The last column of figures in Table 2 shows the value obtained for the semipartial 
multiple correlation between each outcome posttest and the set of five ICEQ 
scales when the effect of corresponding pretest was controlled. Although values of 
the semipartial multiple correlation (0.22 to 0.27 in Table 2) were somewhat 
smaller than the corresponding values of the multiple correlation, they were still 
found to be significantly greater than zero (p<.01) for each of the seven attitude 
scales. Furthermore, since the square of the semipartial multiple correlation 
represents the increment in explained variance (Cohen & Cohen, 1975), it can be 
seen that the set of ICEQ scales accounted for an increment of between 
approximately five and seven per cent of the variance in different outcome 
posttests over and above that attributable to corresponding pretests. 


Although the use of multiple correlations and semipartial multiple correlations 
overcomes the problem of collinearity between ICEQ scales, collinearity between 
outcome measures could still give rise to an inflated experimentwise Type I error 
rate. Canonical correlation analysis, however, can provide a parsimonious picture 
of relationships between a domain of correlated attitudinal outcomes and a 
domain of correlated environment characteristics. Consequently two canonical 
analyses were conducted using standard scores (i.e., scores defined as the number 
of standard deviations above or below the mean). The reason for using standard 
scores is that a more meaningful interpetation can be made of canonical weights if 
all variables in the analysis have been standardised. The first canonical analysis 
was analogous to the multiple correlational analysis in that it explored 
relationships between scores on the set of five environment scales and raw scores 
on the set of seven attitudinal posttest outcomes. The second canonical analysis 
was analogous to the semipartial multiple correlational analysis in that it explored 
relationships between scores on the set of five ICEQ scales and residual posttest 
scores (adjusted for corresponding pretest) for the set of seven attitudinal 
outcomes. 

Table 2 shows that each canonical analysis revealed one significant canonical 
relationship (p<.01). A significant canonical correlation of 0.43 was found between 
environment scales and raw posttest scores on the attitude scales, and a significant 
correlation of 0.40 was found between environment scales and residual posttest 
scores on the attitude scales. 

The five separate analyses reported in Table 2 each provides strong support 
for the predictive validity of the ICEQ. Each analysis, however, leads to a 
somewhat different interpretation of relationships between individual outcomes 
and individual environment dimensions. For example, the simple correlational 
analysis suggests that significantly more positive attitudes were expressed on all 
attitude scales except Attitude to Inquiry in classes perceived as having greater 
Personalisation, on all seven attitude scales in classes perceived as having greater 
Participation, on all attitude scales except Attitude to Inquiry in classes perceived 
as having greater Investigation, and on two attitude scales (Attitude to Inquiry 
and Leisure Interest in Science) in classes perceived as having less Differentiation. 
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The most sophisticated analysis — the canonical analysis of the set of ICEQ 
scores and the set of posttest attitude scores residualised for corresponding pretest 
— revealed a somewhat different picture. An examination of the canonical weights 
for this analysis revealed that, with corresponding pretest controlled, more 
favourable attitudes on the Social Implications of Science, Normality of Scientists, 
and Enjoyment of Science Lessons scales were found in classes perceived as being 
characterised by greater Personalisation, Participation, Independence, and 
Investigation. 


Conclusion 


This article has described a study of the predictive validity of student 
perceptions on a new instrument, the Individualised Classroom Environment 
Questionnaire (ICEQ), which measures those dimensions which differentiate 
individualised and conventional classrooms. The sample consisted of 320 junior 
high school science students, and the outcomes studied were seven science-related 
attitudes. Relationships between end-of-year attitude outcomes and the classroom 
environment dimensions were explored using five different methods of data 
analysis. These were a simple correlational analysis, a multiple correlational 
analysis involving the prediction of each attitude posttest from the set of ICEQ 
scales, a semipartial multiple correlational analysis involving the prediction of 
each attitude posttest from the set of ICEQ scales but with corresponding 
beginning-of-year attitude controlled, a canonical analysis involving ICEQ 
dimensions and raw scores on attitude posttests, and a canonical analysis involving 
ICEQ dimensions and residual attitude posttest scores adjusted for corresponding 
beginning-of-year attitude. Taken together, results from the five separate analyses 
provided strong and consistent support for the predictive validity of student 
perceptions of classroom environment as measured by the ICEQ. 


It is noteworthy that the results of the present study generally indicated a 
positive association between the degree of classroom individualisation and student 
attitudes. This general pattern of results is consistent with Horwitz’s (1979) recent 
review of open education research which showed that, of 25 attitude studies with 
statistically sigiificant results, attitudes were more favourable in open classrooms 
than in traditional classrooms in 23 of the studies. 


In addition to providing information about the predictive validity of the ICEQ, 
this paper also reported additional information supporting the ICEQ’s internal 
consistency, discriminant validity, and ability to differentiate between the 
perceptions of students in different classrooms. Although this study has furnished 
some evidence about the validity and potential usefulness of the ICEQ in 
educational research, it is desirable that other research workers further explore the 
reliability, predictive validity, and usefulness of the ICEQ with a variety of 
samples from different countries. 


A copy of the ICEQ and scoring instructions is obtainable from the author at School of Education, Macquarie University, North 
Ryde, N.S.W. 2113, Australia. 
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Identifying Learning Disabled Students 
with a Selected Psychoeducational Test Battery 


The following study was undertaken to determine the extent to which a 
commonly-used test battery in a local-urban school setting could identify a 
preselected group of learning disabled (LD) children. Using an ecological control 
sample approach, 236 studenis in grades four to six were tested. The 25 LD 
students were later matched at random with 25 Controls. A stepwise linear 
regression analysis was performed on the total sample, with 22 independent 
variables and “membership in LD vs. Control” as the dependent variable. A 
Hotelling’s T? analysis was conducted on the second group of matched LD and 
Control students. The regression analysis and T°’ results clearly indicated that 
the psychoeducational test battery was ineffective in predicting group 
membership, accounting for only 35% of the variance in prediction. Although 
both cognitive and noncognitive measures entered into the regression equation, it 
was concluded that a child’s membership in either LD or Control is primarily a 
function of how his teacher perceives him. 


The search for effective means of assessing learning disabled children is 
continuous and often frustrating. Few professionals feel totally confident in 
interpreting test data to reach the diagnosis “learning disabled.” Historically, 
diagnosis of learning disabilities has focused on a number of isolated variables. 


Mr. Fisk (MEd, University of Alberta) is currently a school psychologist in the County of Strathcona and a clinical supervisor 
in the University of Alberta School Psychology program. His research interests have focused on intervention and diagnostic 
techniques for children with learning problems. 


Dr. Janzen (PhD, University of Calgary) is currently Director of the School Psychology program at the University of Alberta 
and Professor in the Department of Educational Psychology. His area of specialization is school psychology. Dr. Janzen is the 
Western Canadian liaison for the International School Psychology Association and the International Council of Psychologists. 


252 


Identifying Learning Disabled Students 


Deficits and strengths within the child were identified and labelled. Task analysis 
has been used to determine level of skill; learning styles have been analyzed and 
matched with supposed teaching models; motivation, interests, and attitudes have 
been inventoried and plotted; attention has been given to the significance of 
parents, siblings, and the home environment. No one would question the 
importance of each of the variables assessed; however, rarely are they all given to 
reach a diagnosis. Taken singly they provide little information (Hardin, 1978). 


The problem faced by school psychologists is that they are often placed in a 
position where a request is made for a diagnosis of the presence or absence of a 
learning disability. One reassessment that must be made concerns the knowledge 
the LD specialists claim to have regarding identification. The special knowledge of 
these specialists, indeed, the special knowledge on which the entire field rests is the 
ability to diagnose the presence of learning disabilities in children. 


Much of the current debate revolves around the basic question of whether or 
not the field can back up its claims to knowledge and diagnosis with valid empirical 
evidence. The answer, says Coles (1978), rests in large part on the nature of the 
learning disabilitiés test battery — the set of tests used to determine the quantity 
and quality of learning disabilities in children. 

The central tests in a battery usually evaluate cognitive, affective, and 
psychoneurological function. Specifically, tests are chosen to assess perception, 
language, intelligence, self-concept, personality (e.g. locus of control), and school 
achievement areas. Although the tests are usually categorized as dealing with 
separate functions, they often overlap since they are all designed to fit (more or 
less) a model of learning disability based upon an accepted definition. An 
acceptable definition, and one used in this study is as follows: 


A learning disabled child is one who manifests an educationally significant discrepancy 
between his tested intellectual potential and actual level of performance. This 
discrepancy may be evidenced in the area of understanding or using language, spoken or 
written, which in turn may manifest itself in imperfect ability to listen, think, speak, 
read, write, spell, or do mathematical calculations. This discrepancy will not be caused 
by mental retardation, educational or cultural deprivation, severe emotional disturbance, 
or sensory loss. (County of Strathcona #20, 1978, p.12) 


It is apparent that this definition has its limitations. The purpose of a test battery 
is to determine whether or not a dysfunction (neurological or otherwise) is impeding 
the child’s ability to learn under otherwise normal conditions, a dysfunction that 
can exclude the clauses listed in the definition. 

An individual LD test battery will, of course, vary according to problems a child 
is suspected of having, a school psychologists’ preference for particular tests, and 
the available resources of a school system, agency or clinic. A recent review by Coles 
(1978) of the ten most frequently recommended tests and evaluations (ITPA, 
Bender, Frostig, Wepman, Lincoln-Oseretsky Motor Development Test, 
Graham-Kendell Memory-For-Designs, Purdue Perceptual-Motor Survey, the 
WISC-R/Binet, a neurological exam by a neurologist, electro-encephalogram) used 
for diagnosing learning disabilities produced the following summation: 


There is little question that eventually the tests reviewed here will be discarded; the 


evidence against them is mounting. The central question is really whether recognition of 
the invalidity of these tests will result in abandonment of an untenable professional 
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dogma, or whether it will merely result in the test battery being replaced by other 
equally questionable instruments. (Coles, 1978, p. 335) 


In response to such a critical review, the following study was undertaken to 
determine the extent to which a commonly used test battery in a local-urban school 
setting could identify a preselected group of learning disabled children. 


Ecological Control Sample 


All of the 10 division II classes of Brentwood Elementary School, County of 
Strathcona No. 20, participated in the study. There were three each from grades 
four to six, and one full time learning disability class. This school is representative 
of schools in the County of Strathcona and many other urban communities. It is 
typically middle class, including families of blue collar and service workers and of 
some professionals. 


The initial population had 252 students in it, but 13 students either moved or 
transferred out of the school before the year end data could be collected. Three 
subjects were eliminated from the analysis due to incomplete data. Of the 236 
included, 122 were boys and 114 were girls. The total sample of 236 included the 25 
subjects classified as LD. A description of the LD sample follows. 

The study was conducted in early December to allow the teachers adequate time 
to become familiar with their students and thus be able to complete the A-M-L 
Behavior Rating Scale. Also, this is an interval in the school calendar between 
testing-reporting periods, and far enough in advance of a major holiday that 
“holiday heart” is not a contaminating factor. 


Selection Procedure and Description of the LD Sample 


This study included 10 students from the full-time learning assistance 
classroom (these students had previously received the benefit of a support service 
classroom) and 15 students currently attending a part-time resource room program 
(these students were first-time attenders in the special class). The total LD sample 
therefore comprised 25 students in grades 4 to 6 who were experiencing a two-year 
delay in one or several academic areas and who attained a full-scale IQ of 90 or 
higher using the Lorge-Thorndike Intelligence Test. 


Following the description offered by Boder (1971), the 25 students in the 
present sample were categorized as learning disabled, primarily in the major 
academic areas of reading, language, mathematics, or spelling, but where 
nonspecific factors were responsible for poor achievement. The classification of the 
sample into dysphonetics, dyseidetics, and alexics was not possible since 
achievement failures seemingly occurred without abnormal reading-spelling 
patterns: Other systems of classification such as those of Doehring and Hoshko 
(1977), Holmes and Peper (1977), Mattis, French and Rapin (1975) were not 
considered since the LD subjects in this sample demonstrated little agreement on 
the syndromes selected by these researchers for classification. Thus, this sample 
qualified for placement primarily on the basis of average or better IQ, with a 
two-year deficit in a major achievement area, reasons for this delay being classified 
as nonspecific, although some may have been classified as dysphonic or dyseidetic. 
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Procedure 


The investigation was conducted with the 236 division II elementary school 
students. The instruments composing the battery were administered in two sessions 
per classroom by the investigators during regular classroom hours. The measures 
were presented so that each of the two testing sessions took approximately 45 to 50 
minutes of classroom time. The exception to this procedure was with the grade four 
classes who were requested to spend two additional one-hour sessions completing 
the Lorge Thorndike Intelligence Test. The Lorge Thorndike is administered in 
grade five, so scores were available from this test for the grade five and six students 
from their cumulative records. It was necessary to have the grade four students 
complete this instrument to have an equivalent measure of intelligence for all 
subjects. Thus, four instruments were administered by the investigators (Wide 
Range Achievement Test, Academic Achievement Accountability [AAA] Scale, 
Student’s Perception of Ability Scale [SPAS], Beery Developmental Test of Visual 
Motor Integration [VMI]) to all division II students, and the additional IQ test to 
the grade four students. The homeroom teachers of each class were requested to 
complete an A-M-L Behavior Rating Scale on each pupil in their class. Informal 
sessions were held with teachers to explain the method used in completing the 
rating scales, and to counteract the possibility of a “halo effect.” 

The instruments which required the student to read to complete a question 
were read aloud by the examiner to the students at all grade levels so they would 
not be penalized by poor reading skills. Similarly, extended time limits were 
allowed on items requiring motor responses. 

The Canadian Test of Basic Skills was administered in May, 1978, as part of the 
school systems standardized testing program. It was administered by the classroom 
teacher, and computer scored. Results return to the schools early in June of each 
year. 


Description of Tests 


The test instruments for identifying LD students were selected on the following 
bases: 


1. four of the measures were commonly used throughout the school district 
(VMI, CTBS, L-Thorndike, and WRAT), 

2. literature on predictors of school achievement suggested that consideration be 
given to noncognitive or personality contructs (AAA, AML, SPAS scales), 

3. the instruments had to be group administered because of the importance of 
administrative time and efficiency, they must have had objective scoring 
systems, and they must have satisfied adequate reliability and validity data. 

A description of the instruments, with reliability/validity data, follows. 

Lorge Thorndike Intelligence Test. This test consists of a Verbal Battery (5 
subtests) and a Non-Verbal Battery (3 subtests) with specific items for each grade 
level. The odd-even reliability of the tests for the grade level tested (grade four) is 
as follows: r=.93 for the Verbal Battery, and r=.93 for the Non-Verbal Battery 
(Level B, N=549). 

The validity is established by correlations with other tests designed to measure 
achievement. Correlations of the verbal battery with the WISC Verbal Scale and 
the Stanford-Binet are reported “in the high seventies and low eighties.” The 
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Non-Verbal Scale correlated with the same tests in the “high sixties and low 
seventies ” (Lorge, Thorndike & Hagen, 1967, p. 29). 

Canadian Test of Basic Skills (CTBS). The CTBS is a_ standardized 
elementary achievement test comprised of eleven subtests which yield six subtotal 
scores. The split-half reliability coefficients for the grade levels tested are in the 
.88 to .98 range, with higher reliability coefficients (.95, .98) for language and 
composite scores. 

Validity figures are unavailable for the CTBS. The editors suggest that 
validation is the task of the test interpreter. Because it is impossible to design one 
achievement test satisfying nationwide objectives, the editors suggest that 
administrators decide the relationship of the test to local objects. (Hieronymus & King, 
19757 p46): 

The Wide Range Achievement Test (WRAT). The WRAT consists of three 
subtests measuring achievement in reading, spelling, and arithmetic. The reading 
subtest was not included in this study as it cannot be group administered. 
Test-retest coefficients range from .92 to .98 for the reading and spelling subtests, 
and from .85 to .92 for the arithmetic subtest. Using internal consistency for 
validation of the subtests, the intercorrelations between Spelling and Arithmetic 
subtests are .73 (age 9), .76 (age 10), and .76 (age 11). Wagner and McCloy (cited 
in Jastak & Jastak, 1978) compared the WRAT reading subtest with the 
Woody-Sangren Silent Reading Test and the New Stanford Reading Test and 
arrived at coefficients of .78 and .80 respectively. 


The Beery Developmental Test of Visual-Motor Integration. The VMI is a 
form copying test measuring sensory perception and motor skills with 24 designs 
arranged sequentially by level of difficulty. Dyckman and Rentfrow (1971) report 
test-retest correlations on children of second, fourth and sixth grade of .62 to .84 
(Dyckman & Rentfrow, 1971). 


A-M-L Behavior Rating Scale. The A-M-L Rating Scale is an 11-item teacher 
rating scale designed to identify children with learning and/or behaviour problems. 
The teacher records student behaviour in three areas (aggression, moodiness, and 
learning difficulties) on a five-point scale. In a study of children from 
multi-problem families, the Kuder-Richardson 21 reliability formula showed an r 
of .74 for one scale and .92 for the other. This scale was included in the study to 
determine if it was beneficial in improving early identification procedures (Clifford 
& Cleary, 1972). 


Academic Achievement Accountability (AAA) Scale. The AAA is an 
18-question student completed questionnaire measuring locus of control regarding 
academic outcomes. The authors report a median Kuder-Richardson formula 20 of 
.74. The correlations between Composite Grade Equivalency on the Iowa Tests of 
Basic Skills and the AAA are in the .30’s and significant beyond the .05 level. 
Means and standard deviations are relatively stable across grades, averaging 75.0 
and 9.0 respectively. A high score indicates acceptance of responsibility for 
academic outcomes. The score range is 18-90 (Clifford & Cleary, 1972). 

Students’ Perception of Ability Scale (SPAS). The SPAS measures academic 
self-concept in elementary school children. Estimates of internal consistency using 
Cronbach’s alpha yielded a full scale alpha of .92. The test-retest reliability over a 
four- to six-week period was .83. The authors report that the ability of the SPAS 


to identify students with learning problems suggests good experimental validity for 
the SPAS. 
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The SPAS subscales measure academic self-confidence in areas of (1) general 
ability, (2) arithmetic, (3) reading and spelling, (4) school satisfaction, (5) 
penmanship and neatness, and (6) confidence. The seventh scale is an index of 
“general academic self-confidence” (Boersma, Chapman & Maguire, 1979). 


In total, 22 variables were obtained, including one IQ measure (Lorge 
Thorndike), six achievement measures from the CTBS, two achievement measures 
from the WRAT, four personality measures (AML scale), seven self-confidence 
measures (SPAS), one locus of control measure (AAA scale) and one visual-motor 
integration score (SMI scale). For a list of these variables, see Table 1. 

Data Analysis 


The data were analyzed using a stepwise multiple-regression analysis followed 
by the more stringent Hotelling’s T? analysis. The stepwise regression analysis was 
conducted using group membership (control “normals” or learning disabled) as the 
criterion. The Hotelling T? was conducted with 25 students randomly selected 
from the control group and compared with the learning disabled group. Each 
variable was examined for differences between the two groups. 


Results 


Table 1 presents a summary of the results for the predictor variables: means 
and standard deviations for the learning disabled group, the control group, and 
the total group taken collectively. As can be seen from Table 1 the differences 
between means (without regard to statistical significance) are in agreement with 
general predictions. On many of the variables, a high score suggests academic 
success; it is expected that members of the control group would score better. On 
other measures, a high score indicates some learning difficulties; the learning 
disabled students tend to score higher. Scores on variables such as achievement 
(Wide Range Achievement Test spelling, Canadian Tests of Basic Skills total), 
locus of control (Academic Achievement Accountability Scale), intelligence (Lorge 
Thorndike), and self-concept (Student’s Perception of Ability Scale) do appear 
higher for the control group. Similarly, the A-M-L scores, where a high score 
indicates teacher concern with pupil behaviour, differ with the learning disabled 
group scoring higher. An interesting exception, however, is on the Student’s 
Perception of Ability Scale, confidence subscale where the learning disabled group 
scores are larger. This contradicts the expectations that learning disabled children 
would demonstrate less academic confidence (Black, 1974). 

Given that a learning disabled child is defined as having average intelligence or 
better, one would expect intelligence scores to be very similar. While both groups 
are within the average range, the control group scored 11 points higher. Possibly 
this is due to the nature of the Lorge Thorndike Intelligence Test which is a 
paper and pencil instrument. The verbal battery requires the examinee to'read the 
items and learning disabled children may be penalized by this approach. 


Multiple Regression Analyses 

The purpose of multiple regression in this investigation was to determine 
which linear combination of scores are the best predictors of group membership, 
specifically, which psychoeducational tests or which combination of them are most 
effective in discriminating between the learning disabled and the normal students. 
Table 2 presents a summary of the predictor variables using group membership in 
either the control group or the learning disabled group as the criterion variable. 
Twenty variables were entered into the regression equation. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS ON ALL VARIABLES 


Learning Disabled Control Group Total Group 
(n=25) (n=211) (n=236) 
Variable eV eae = = 
X s.d Xx s.d. X s.d 

IQ 97.24 iodsl 108.39 WGA) 107.21 12.38 
VMI 16.68 79 Xs) ih ff tS 2a73 W7e13.0 Zea 
A of A-M-L 7096 Spo4 8.02 6) 45) S02 3329 
M of A-M-L 8.64 Bie0 7.93 3.102 8.00 Bi 2 
L of A-M-L 3.60 O75 2.04 0.97 De PaAl 1.06 
A-M-L Total 20.20 6.79 LAO 5.99 18.20 6.23 
AAA (locus of control) 65n 32 LL IG) 72.89 8.68 72.08 9.43 
WRAT Spelling 40.16 6.14 FE hil 51459 46.61 6.08 
WRAT Arithmetic 31.04 Sel 33.69 3:26 Seiocal B28 
CTBS Vocabulary 2M Se 8. 36 DOS 6.87 bf SAG, Tee 
CTBS Reading Comp. Z0n52 OS 42.63 226 LOSS 2 ee 
CTBS Language 56.84 155.90) 90.50 Zara S 86.94 24.84 
CTBS Work Study Skills 46.08 14.98 64.38 18.54 62.44 19.09 
CIBS Mathematics 26.08 8.85 40.24 V7 38.74 Lys 8X0) 
CTBS Total 180.64 46.42 XO5.5 If 8 64.10 2 Orie 67.87 
SPAS General Ability 7.04 Zee Tf sloxe: Zarteye Tasoisk 2.80 
SPAS Arithmetic See 2.98 9.44 2.89 Des RS 
SPAS School Satisfaction 6.04 2.99 TOD DRS het) 2.84 
SPAS Reading/Spelling 6.88 3.06 8.5. Bolla 8.26 8 22 
SPAS Penmanship/Neatness 6.36 Sh 7.45 3)..02 Pook 3.08 
SPAS Confidence Solas lies S)s) 3) 0) sr sio) Ssh} in) 
SPAS Full Scale 37.96 OZ 43.97 14 “Seo 3 11.66 


The ‘L’ of the A-M-L Behavior Rating Scale is the most powerful measure in 
terms of predicting group membership. Teachers not only seem to be able to 
predict student academic achievement, they also appear to be effective at 
identifying learning disabled students. They rate children according to what is 
present and observable in the classroom. They know if pupils are having difficulty 
achieving, and the L scale is a reflection of their observation of actual academic 
behaviour. 


Another significant cognitive measure, the Canadian Tests of Basic Skills 
Language subscale, entered fourth in the regression analysis. Language skills do 
differentiate the two populations to a degree. Impaired language ability is one of 
the most prevalent conditions found in the learning disabled and specifically in 
our LD sample (McGrady & Olson, 1970). 


The remaining variables contributing significantly to the regression equation 
are all nonintellectual factors. An examination of the mean scores on the 
Academic Achievement Accountability Scale (a locus of control measure) shows 
that the learning disabled students are more external than the control group. The 
mean score of the Confidence subscale of the SPAS scale also discriminates 
between the two groups, but surprisingly, it is the learning disabled sample that 
scores somewhat higher. If SPAS does measure academic self-confidence, it seems 
reasonable to predict that high scores on self-confidence would be internally 
controlled. These apparent contradictions may be the result of special conditions 
in the learning assistance classes from which the learning disabled sample was 
drawn. It is possible that the confidence observed here is in fact externally 
produced. 
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TABLE 2 


MULTIPLE STEP-WISE REGRESSION 
PREDICTING GROUP MEMBERSHIPa 


Ste yh ‘ v) - 
o variable Ente mar rrctab gt hp poet p ts 
1 L of A-M-L* 60.23 =< (0) 100) 20.47 == 
2 AAA* 7.09 0.008 DPE aad Nes 35 
3 SPAS Confidence* 6.09 0.014 24.79 1.98 
4 CTBS Language* 5.88 0.016 26.66 ibe) 
5 A of A-M-L* 7.08 0.008 28.85 Tals, 
6 SPAS Penmanship/Neatness DERE 0.128 AEST 0.72 
vi WRAT Spelling Ivotesil 0.180 0,12 0.56 
8 SPAS Reading/Spelling 2.69 Ono? 30.94 0.82 
9 SPAS School Satisfaction 0.78 Os 2/7) 318 Ore: 
10 IQ 0.88 0.350 B45 De 2 
ite CTIBS Mathematics TAZ I 0.272 31.82 ORs 
iy CTBS Work Study Skills 1.19 OV2i7 B28 0.36 
1k3} CTBS Vocabulary 0.58 0.447 yan Cis) Ons 
14 CIES Total 155 On215 325.95 0.47 
15 CTBS Reading Comprehension Wee OL19L Seg eD) OF 2 
16 SPAS Full Scale 0.20 0.659 33.41 0.06 
1 A-M-L Total ORZ2 0.639 33).47 0.07 
18 M of A-M-L 6.29 0.013 S55 3h) iow 
19 WRAT Arithmetic 0.24 0.626 Spe 0.07 
20 SPAS General Ability Onn: 0.916 SGU 0.01 


Note. N=236, p=.05. 
No further steps entered beyond step No. 20 with a0.95 


20.95 


* Denotes variables significant at a=.05 


There are two additional measures which belong to this nonintellectual 
collection of predictors, the ‘A’ and the ‘M’ of the A-M-L scale. The A subscale 
measures aggression; the M, moodiness. The group of four noncognitive measures 
(locus of control, self-confidence, aggression, moodiness) contribute a combined 
variance of almost 9% to the total variance in predicting group membership. 


Academic measures, with the exception of the Canadian Tests of Basic Skills 
Language subtest, appear to contribute little towards the prediction of group 
membership. This may be partially accounted for by the predominance of the L of 
A-M-L ratings. The teacher, when completing this scale, probably considers a 
number of intellectual components in arriving at a rating. These considerations 
ultimately overshadow the results of more formal assessment. 


Teacher ratings and nonintellectual personality factors account for only 35% of 
the total variance in predicting group membership. It appears that the 
methodology employed in this investigation ignores the heterogeneous character of 
the learning disabled group. The tests used not to discriminate significantly 
between learning disabled and controls. 
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Hotelling T’ Analysis 


When comparing the difference between means of two groups on a single 
variable, one could employ a ¢t test. However, when more than one variable is 
used, a more rigourous test is the Hotelling T? statistic. This procedure compares 
the differences between all means simultaneously using the covariance among the 
variables to refine the overall relationship between groups. Results of this analysis 
are presented in Table 3. 


TABLE 3 


HOTELLING T? — COMPARISON BETWEEN 
LEARNING DISABLED AND CONTROL GROUPS 


Variable LD X. Control T2 P p level 

Wee eH, 97.24 109.04 20)..63 1.485 0.18 

2. VMI 16.68 I oS Binds) 0.248 0.99 

Big Na Odi SZ 75.40 Hil 6 55) 0.831 0.61 

4, WRAT Spelling 40.16 48.36 2), 1.668 Omi 

5. WRAT Arithmetic 31.04 Bae6 els) 0.413 0.94 

6. CTBS Vocabulary di des SV 28.88 2a 0.900 OR SS 

7. CTBS Reading Comprehension BOESZ 45.64 Dea, 12783 0.09 

8. CTBS Language 56.84 93.12 ey aie ees 3.897 <0.01 

9. CTBS Work Study Skills 46.08 66.92 A Sul WaPo 0.28 

10. CTIBS Mathematics 26.08 40.08 24.349 hn PS <0.09 
Lit ChESlo calll 180.64 274.64 38.03 25137, 0.01 
Variables 1 - 11 N = 25 Ho 3S) 5} 5)5)// <0.01 

ie IN 7.96 PANS Oval OcOls 1.00 
Alice wi 8.64 Wade 0.96 0.069 0.99 
Tos lls 3.60 LeiG Uf ihe, TS) Dia LES <0.01 
15. A-M-L Total 20.20 ITE OA Be 62 0.260 Ong 
16. SPAS General Ability 7.04 8.08 OS O33 0.99 
17. SPAS Arithmetic Bir 2 a2 Dadi] 0.199 0.99 
18. SPAS School Satisfaction 6.04 8.24 8.41 0.605 0.81 
19. SPAS Reading/Spelling 6.88 9.00 5.50 0.400 0.95 
20. SPAS Penmanship/Neatness 6.36 Te Q2 B21636 0.262 0.99 
21. SPAS Confidence Boe Br eOi2 1.003 0.072 0.99 
22. SPAS Full Scalle Shi) HE, 46.88 7.668 ORD D2 0.85 
Variables 12 - 22 N = 25 60.967 4.388 <0.01 


On the basis of the 22 variables included in this investigation, considered 
simultaneously, the Hotelling T? indicates a significant overall difference between 
the two groups (p<.01). Individual comparisons on the 22 variables show some 
group differences(p<.01). The discriminating measures include the L of the 
A-M-L scale, the Canadian Tests of Basic Skills Language subtest, and the 
Canadian Tests of Basic skills Total Score. Other differences, significant at the 
px.10 level, are CTBS reading comprehension and CTBS mathematics subtests. 
The five variables just mentioned are cognitive in nature and achievement 
oriented. Less significant differences (p<.10 to .30) are those found on the Wide 
Range Achievement Test Spelling, Intelligence (Lorge Thorndike), and CTBS 
Work Study Skills. The remaining variables, VMI, AAA (locus of control), WRAT 
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Arithmetic, CTBS Vocabulary, A and M of the A-M-L scale as well as A-M-L 
Total, and the SPAS, do not appear to discriminate between the two groups to a 
significant degree. No major differences are apparent on any of the personality 
measures. 

In reviewing all of the analyses, the L subscale of the A-M-L is consistently an 
important measure. In the regression analyses cognitive variables appear to 
account for the greatest amount of the variance, but affective variables do enter 
and contribute. This is not the case, however, when examining mean differences 
on all the variables between the two groups. The Hotelling T? analysis reveals that 
the variables showing statistically significant discrimination tend to be 
achievement oriented. The CTBS Language and the CTBS Reading focus on 
special areas of difficulty for learning disabled children. 


The definition of a learning disabled child used in the County of Strathcona 
includes an achievement discrepancy clause. These test results clearly show this 
discrepancy and confirm this part of the definition. Nevertheless, the fact that 
clear differences exist between the learning disabled group and the controls must 
be reconciled with the fact that prediction of group membership from the multiple 
regression analyses was less than ideal. 


Discussion and Implications 


The major conclusion to be drawn from this study is that this battery of 
psychoeducational measures is somewhat of a failure in predicting group 
membership in the learning disability category. Multiple regression analysis does 
discriminate between the two groups on the basis of achievement, but not on the 
basis of predicting group membership. This suggests a number of comments. 
Accounting for only 35% of the variance after introducing 20 test variables into 
the regression equation seems to indicate that they are not powerful predictors. 
The tests employed here have a low discriminative ability. From examination of 
the results of these tests, it is difficult to determine the qualifications for 
placement in a learning disability class. The single most important predictor, L of 
the A-M-L Behavior Rating Scale, singly accounts for 20% of the variance. While 
teachers are good at predicting academic achievement, they are less effective at 
predicting personality and psychological variables. 

Accounting for only 35% of the variance of group membership and predicting 
achievement much more effectively than learning disability are reasons to assume 
that some unidentified factors are involved in this study. 

One major drawback involves the global nature of the measures used. Several 
of them, such as the intelligence score, the WRAT scores, and the VMI yield a 
gross composite score. That is, the instruments are not composed of specific 
subtests sensitive to individual developmental differences in the subjects. General 
measures of achievement, intelligence, and motor skills may be acceptable as 
initial screening device, but may be ineffective predictors in individual cases. 

Another problem is the manner in which the scores were treated for analysis. 
The procedures ignored the possible variation within each child’s profile of scores 
on the several variables. In the learning disabled group, it may be that it is the 
relationship between variables that accounts for their learning problems. To use a 
group score on each variable or to use all variables with equal emphasis for each 
child thus increases the likelihood of not seeing profile characteristics which may 
very well be significant determinants of their disability. 
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The characteristics of underachieving children vary greatly, and numerous 
terms have been employed to describe their symptoms, behaviours, psychological 
nature, and physical traits (Wallace & McLaughlin, 1975; Baren, Liebel & Smith, 
1978). The heterogeneity of the learning disabled population necessitates an 
individual analysis of each learning problem. It does not appear that one can 
identify, through selected psychoeducational tests, a single or group of test-related 
constructs that will effectively identify these children. Given this difficulty, 
perhaps Coles’ (1978) criticism of the tests currently used in the identification is 
warranted. Most of the tests used in “learning-disability identification” were not 
initially constructed for that purpose. Use of a battery to identify a learning 
disability will, therefore, lead the school psychologist to make judgments based on 
invalid criteria. The answer to such a dilemma would not be that other tests be 
selected or developed, but rather that an ecological or “total” assessment model be 
followed that considers the interaction between all relevant factors in isolation 
(Hardin, 1978). 


This research project was supported by a University of Alberta Research Grant #55-32856. 
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Cross-Validation of Das’ Simultaneous 
and Successive Processes with Novel Tasks 


The successive and simultaneous factors identified for numerous samples by 
Das and his colleagues were also isolated for the present sample of 77 normal 
children in grades K-6. The emergence of these dual modes of processing 
information using a test battery composed of several novel tasks provides 
evidence of the generalizability of the successive and simultaneous syntheses 
that Das has investigated in his applications of Luria’s theory. 


Das, Kirby, and Jarman (1975, 1979) present a considerable amount of 
research evidence in support of Luria’s model of cognitive functioning, particularly 
the aspect involved with simultaneous and successive processing. Simultaneous 
processing involves stimuli which typically have spatial components and require 
multiple processing, and is linked by Luria to the occipito-parietal areas of the 
cortex. This type of processing is necessary to form any holistic gestalt and has 
been associated with tasks such as Raven’s Progressive Matrices, Memory for 
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Designs, and Design Copying. Simultaneous processing is likely to be involved 
when a task requires the organization of stimuli with consideration of the 
relationships between each of several components, regardless of the specific 
content of the task. In contrast, successive processing involves the integration of 
stimuli into some temporally organized series. That is to say, each stimulus is 
related only to the preceding and following ones in a linear fashion. Serial recall of 
words, free recall of words, and WISC-R Digits Forward have been the tasks most 
frequently used to identify successive processing factors. According to Luria 
successive processing is linked to the fronto-temporal areas of the cortex. 


With few exceptions, Das, Kirby, and Jarman have employed the same set of 
tasks to isolate simultaneous and successive processes in factor-analytic 
investigations of children. In addition, Das, his colleagues, and his students have 
been the predominant researchers in this area of study. One attempt at 
cross-validation by independent investigators with a sample of college students 
(Vernon, Ryba & Lang, 1978), using tasks similar to those developed by Das, 
produced apparently negative findings. However, college students represent a 
group that is quite different from the groups of school-age children studied by Das 
and his colleagues; in addition, the interpretations given the data by Vernon et al. 
(1978) concerning the lack of support for the Das model seem open to challenge 
(Das, Cummins, Kirby & Jarman, 1979) 

It is evident that replications of Das’ simultaneous and successive factors are 
needed by independent groups of researchers using different types of tasks from 
the ones used by Das and his colleagues. Such studies should begin by 
cross-validating the Das model with school-age children, rather than with samples 
such as college students which have not been systematically investigated by Das et 
al. (1975, 1979). The present study represents one such attempt at cross-validation 
with a school-age population. 


Method 


Sample 

The sample was comprised of 77 children (39 boys, 38 girls) in grades K 
through 6 who attended elementary school in Flagstaff, Arizona. The school is 
conducted by Northern Arizona University, in which it is located, and contains 
many children of students, faculty, and staff, as well as a substantial percentage of 
children from non-university families. The children were spread fairly evenly 
across the grade range, with 10-14 selected randomly from each grade level. They 
ranged in age from 52 to 150 months (mean = 99.5 months, SD = 25.6 months) 
and the racial breakdown was as follows: 94% White, 4% Black, 1% American 
Indian, 1% Asian. Otis-Lennon IQs were available for 43 of the children in grades 
2-6. They earned a mean of 113.8 (SD = 14.0), suggesting that the sample as a 
whole is above average in intelligence. 


Instruments 

A battery of 11 tasks, six believed to measure simultaneous processing and five 
hypothesized to assess successive processing, was used. Of the existing tasks, 
WISC-R Digits Forward (Wechsier, 1974) served as the marker test for successive 
processing and Raven’s Progressive Matrices (Raven, 1956, 1960) was the marker 
for simultaneous processing. There is a wealth of evidence to show that these two 
tests consistently load as predicted for a variety of populations (Das et al., 1975, 
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1979). Whereas Das and his colleagues administer the Coloured Progressive 
Matrices, we administered the Coloured Matrices (Levels, A, AB, B) along with 
Levels C and D of the Standard Matrices to ensure sufficient difficulty for the 
children in the intermediate grades. (Maximum score on Raven’s Matrices was 
equal to 60 points.) The other two existing tasks were WISC-R Block Design and 
Digits Backward. Block Design was hypothesized to be a simultaneous task, and 
Digits Backward successive. However, the latter hypothesis was tentative since 
there is reason to believe that reversing digits differs considerably from repeating 
digits forward (Costa, 1975; Jensen & Figueroa, 1975). 


The seven new tasks were all constructed as part of the test research and 
development endeavour leading to the Kaufman Assessment Battery for Children 
(K-ABC; Kaufman & Kaufman, in press). These tasks (some of which are included 
in revised form in the K-ABC) are described below; the first four were designed to 
assess simultaneous processing, and the last three were developed to measure 
successive processing. 

Concept Formation. This 47-item subtest is patterned after Bruner’s 
experimental research. The concept formation tasks present to the child six 
pictures (or designs), three of which are positive examples of a concept and three 
which are not. For the first two items, the child is told by the examiner what the 
unifying concept is (e.g. “. . . we are going to look for things to eat”). The child is 
shown the members in the set by the examiner placing a large cutout square over 
each picture and identifying each example of the concept by saying “This is one”; 
for the distractors the examiner simply says “This is not.” After exposure to these 
stimuli, the child is shown an array of new drawings and asked “Is this one?” (an 
example of the concept) for each of the five drawings in turn. After the first two 
items, the name of the concept is not given by the examiner who simply identifies 
which stimuli are and are not examples of the concept. The child must draw the 
conceptual implications by him/herself. The visual stimuli are composed of 
approximately one-quarter abstract shapes and designs and _ three-quarters 
concrete, meaningful pictures. 

Memory for Places. For each of 32 items, the examiner shows the child a page 
containing from one to seven child-oriented drawings of familiar things arranged 
in varying positions on the page. After an exposure time corresponding to one 
second per pictured object, the page is removed. Then a card divided into a grid 
of rectangular blank boxes is presented and the child is asked to point to the 
locations of the objects that were just exposed on the stimulus page. The first 15 
items use a 3 x 3 grid for responses, while items 16 through 25 increase the 
difficulty by using a 3 x 4 grid. Items 26 through 32 expand the grid further to a 4 
x 4 division of the response page. 


3x3 3x4 4x4 
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Triangles. A task of spatial reasoning and visual motor coordination, Triangles 
requires the child to manipulate from two to nine plastic triangular shaped pieces 
(all red on one side and white on the other) to copy designs presented on a set of 
38 item cards. The models to be matched remain in the child’s view throughout, 
and there are no time restrictions. Most of the designs to be copied are themselves 
in the shape of triangles, only a few introductory items are not. The child is 
always given the correct number of triangles needed to complete the construction. 
The first four items require only two triangles, and a gradual increase in the 
number of triangles needed increases the difficulty. Thus, the three-triangle 
designs proceed to four-triangle ones with guide lines and extend to four-triangle 
models with no guide lines. Items 15-26 and 27-38 contain eight and nine triangles 
respectively. This test is intended to be a variant of Wechsler’s Block Design. 


Overlapping Designs. Constructed along the stencil design motif, in this 
subtest the child is given a set of eight irregular geometric shapes. Two pieces are 
yellow, two are white, two are blue, and two are black. The examiner presents 25 
hexagon shaped designs made up of differing amounts of these four colours. The 
child must then place the pieces one on top of the other, using as many as 
necessary to duplicate the given design. Since there are no time limits and the 
model is exposed while the child is constructing the design, memory is not a factor 
for successful performance. The child is required to match the model in both 
hexagon shape and overall size. 


Hand Patterning. This task requires the examiner to produce a series of 
movements using only one hand. A repetitive pattern is composed by a 
combination of gentle taps on the table using either fist, palm, or side of the hand. 
Early items begin with one, then two, such movements and gradually increase up 
to the most complicated of the 17 items (seven stimulus taps). The child watches 
the examiner tap out three sequences of movements at the rate of one movement 
per second (e.g., a relatively easy item was palm-fist, palm-fist). There is a 
one-second pause between each of three units comprising each item. To be 
credited, the child must recreate each unit three or more times in a row. 


Sequential Hand Movements. Here too the examiner uses a hand to gently tap 
out movements, this time using the knuckles in addition to the palm, fist, and side 
of the hand. Each of the 20 items grows increasingly longer and thus more 
complex; the number of movements in each item ranges from one to six. In 
contrast to Hand Patterning where the child is asked to repeat a rhythmic 
pattern, for Sequential Hand Movements the child must produce each movement 
in sequential order only once. Sequential memory is the key to success on this 
task, although motor coordination is also involved. Luria used a similar task in his 
research with brain damaged patients. 

Memory for Words. The stimuli on this subtest are a series of commonly 
known words which the examiner says to the child. The words are read at the rate 
of one per second, the items range from sets of two through six words. The child 
is shown a card with the silhouettes of the pictures of each word and responds by 
pointing to each silhouette in the same order as they were named by the 
examiner. The first 10 items employ a card with six pictures (e.g., leaf, star) and 
the remaining 25 items use a different card with seven silhouettes (e.g., hand, 
moon). The intermediate items use this second card in the same fashion as the 
first one, but the difficulty level is increased in the last 16 items by introducing an 
interference task in between the examiner’s verbal stimuli and the child’s 
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nonverbal response. This task requires the child to name colours presented on a 
card as rapidly as possible for five or ultimately 10 seconds. With the exception of 
the nonverbal mode of response, this task resembles one developed by Luria and 
used by Das et al. (1975, 1979). 


Procedure 

The 11 tasks were administered to the children by examiners who were trained 
in individual assessment and certified as psychometrists. Due to practical 
constraints, all subjects were not given all measures. All 77 children were 
administered Digits Forward and Digits Backward; 76 youngsters were given Block 
Design, Triangles, Overlapping Designs, Memory for Places, and Memory for 
Words; 72 to 75 were given Concept Formation, Raven’s Matrices, and Sequential 
Hand Movements; and 67 were administered Hand Patterning. 

Raw scores on the 11 tasks were intercorrelated, and then a partial correlation 
matrix was obtained, removing the influence of chronological age. This 11 x 11 
partial correlation matrix was subjected to principal components and principal 
factor analyses. For the principal factor analyses, squared multiple correlations 
were placed in the diagonals as the initial estimates of communalities. Iteration 
was performed to obtain better estimates of the communalities, and then varimax 
rotation was applied. The two-, three-, and four-factor solutions were examined to 
determine which one made the most psychological sense. 


Results and Discussion 


The parital correlation, along with means and SDs for the 11 variables, are 
shown in Table 1. The two-factor varimax rotated solution was selected for 
interpretation, and is presented in Table 2. In the three-factor solution, the 
Simultaneous factor split in two, with Raven’s Matrices and Overlapping Designs 
having substantial loadings on both the first and third factors. In addition, the 
third factor was quite small, having an eigenvalue below 0.50 prior to rotation. 
The four-factor solution produced two trivial factors and was clearly 
uninterpretable. 


TABLE 1 


INTERCORRELATIONS AMONG ELEVEN COGNITIVE TASKS, 
WITH AGE PARTIALLED, OUT FOR CHILDREN IN GRADES K-6 


Task (2) (3) (4) 


GGG) 7a C8) 9) en (LO) Seas) Xx sD N 

1. Raven's Matrices 53 40 58 46 32 als) 24 22 31 41 Nese?! 13.3 73 

2. WISC-R Block Design 54 53 32 42 38 29 06 Be 315) Zewl ey 76 

3. Triangles 43 25 35 17 24 IPS} 10 ial 13.8 8.6 76 

4. Overlapping Designs 41 36 26 33 31 28 29 thee” S10 76 

5. Concept Formation 30 09 3 09 19 20 30.8 IW 2 12 

6. Memory for Places 23 17 26 23 26 L6y-7 6.6 76 

7. WISC-R Digits Forward 28 31 29 25 6nd AS) This 

8. WISC-R Digits Backward 19 31 22 3.6 1.9 ri 

9. Memory for Words 34 13 11 33 4.3 76 

10. Sequential Hand Movements 44 Liss PES) 75 
ll. Hand Patterning 7.8 Siett 67 


te { 7 1 — biel 
Note. Decimal points are omitted from the coefficients of correlation. 
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The two-factor solution shown in Table 2 reflects clear-cut Simultaneous and 
Successive factors. The six subtests hypothesized to require simultaneous 
processing had loadings of .45 to .70 on the Simultaneous dimension, and failed to 
load above .32 on the Successive factor. Similarly, the five tasks believed to 
measure successive processing loaded .38 to .73 on the Successive factor, but had 
inconsequential loadings (.25 and below) on the Simultaneous factor. 


TABLE 2 


VARIMAX ROTATED FACTOR MATRIX FOR CHILDREN 
IN GRADES K-6, WITH AGE PARTIALLED OUT 


Task Unrotated Rotated Factors 
Task Peticws lai) ( seaitieaneouagMEUECeeRiveMM Mach oo kin 
Factor il wait p 

Raven's Matrices pl 66%* 30 53 
WISC-R Block Design 74 70* 30 58 
Triangles 55) 64% 07 41 
Overlapping Design 73 67%* 32 56 
Concept Formation 47 48%* 14 25 
Memory for Places 5S} 45% 28 28 
WISC-R Digits Forward 42 18 46%* 24 
WISC-R Digits Backward 43 5} 38% aA 
Memory for Words 36 it 44x 20 
Sequential Hand Movements 5S, tbl 73% 54 
Hand Patterning 49 25 48% 30 


Note. Rotated factor loadings of .35 and above are asterisked. 
Decimal points are omitted. 


The emergence of Memory for Places as a Simultaneous task affirms that 
Factors I and II are not merely “reasoning” and “memory” dimensions, as 
postulated by Jensen’s ability hierarchy theory, but reflect instead the 
simultaneous and successive syntheses that Das et al. (1975, 1979) propose as an 
alternative to the ability approach. Digits Backward was the only task that did 
not load decisively in the predicted direction, producing a .38 Successive loading 
vs. a .25 Simultaneous loading. Perhaps some simultaneous processing is 
demanded for success in repeating digits in a reversed order, as suggested by data 
on brain damaged patients (Costa, 1975; Weinberg, Diller, Gerstman & Schulman, 
1972). 

The results of our study give strong support to the existence of successive and 
simultaneous processing dimensions in normal school-age children, and can be 
interpreted as an independent cross-validation of the model proposed by Das and 
his colleagues. The breadth of the simultaneous dimension has been extended by 
the present data. Das et al. (1975, 1979) have typically utilized simultaneous tasks 
with abstract content such as matrices and design copying, but we have found the 
subtests with meaningful content (Concept Formation, Memory for Places) also 
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load substantially on the Simultaneous factor. The skill involved in Concept 
Formation, categorical thinking, further expands our understanding of the nature 
of simultaneous syntheses. 

The data in Table 2 regarding the unrotated general factor may be interpreted 
as loadings on the so-called g factor. As one might predict, Raven’s Matrices and 
WISC-R Block Design were among the best measures of g, being joined by the 
new Overlapping Designs task. Whereas the very best measures of g were on the 
Simultaneous factor, one cannot conclude that successive processing tasks, as a 
rule, are poor correlates of the g factor. In our study, two Successive tasks 
(Sequential Hand Movements and Hand Patterning) had g loadings that were 
comparable in magnitude to the unrotated first factor loadings of three 
Simultaneous subtests: Triangles, Memory for Places, and Concept Formation. 
Thus, although some researchers (e.g., Vernon et al., 1978) have maintained that 
simultaneous processing is higher than successive processing in the intellectual 
hierarchy, Das may be correct in arguing to the contrary. Das and his colleagues 
insist that the short-term memory tasks they have used to define the successive 
factor reflect a very limited sampling of a considerably broader class of behaviour 
(Das, Kirby & Jarman, 1979). When they have expanded successive processing to 
include various language tasks, greater equity between the successive and 
simultaneous dimensions has resulted (Cummins & Das, 1978; Jarman, 1978). 


Recently, we have identified Successive and Simultaneous factors in the 
WISC-R by factor analyzing data for normal, retarded, and referral populations, 
after first eliminating the Verbal subtests that are heavily dependent on school 
achievement (Naglieri, Kamphaus & Kaufman, in press). As was stated in the 
conclusions of that study, we have provided further empirical support of the 
existence of two styles of information processing, but there can be no inferences 
about the correctness of the brain localizations postulated by Luria for successive 
(fronto-temporal) and simultaneous (occipital-parietal) syntheses. In the present 
study, a blend of old and new tests has yielded clear-cut Successive and 
Simultaneous factors. However, whether these dimensions are related to 
front-back brain functioning (Luria, 1966), left-right hemisphere processing 
(Bogen, 1969), or some other neurological model remains an unanswered question 
that would be a fruitful topic for future neuropsychological research. 


The authors wish to recognize the invaluable assistance of Northern Arizona University Eastburn Educational Research Center 
for their cooperation in this investigation. 
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PERSPECTIVES 


Becoming a Teacher: 
Some Historical Perspectives 


John Calam 


The University of British Columbia 


The Whig historian, says Herbert Butterfield, “very quickly busies himself with 
dividing the world into the friends and enemies of progress” (1931, p. 5). Historical 
research serves to correct this rendition in detail. The tendency, however, has been 
“to patch the new research into the old story even when the research in detail has 
altered the bearing of the whole subject” (p. 6). 


Educational historians have inherited several old stories. One is that from the 
time of Confucius to that of John Dewey, there extends a long line of great teachers 
whose inspirational qualities justify great-person educational history as a proper 
study for preservice teachers. A second intimates how wise those forebears who 
devised forms of public schooling amenable to steady improvement — the roots, 
indeed, of present perfection. A third declares how these public schools enabled 
children of humble origin to get on in life, to be socially mobile. Enduring examples 
are Three Thousand Years of Educational Wisdom (Ulich, 1947); Public Education 
in the United States (Cubberley, 1919); and The Development of Education in 
Canada (Phillips, 1957) respectively. Researchers nowadays take such works to 
task. Some question historical explanations which merely invoke the “influence” of 
past heroes and heroines. Others say there are educators of days gone by who would 
have considered such ideas as public schooling insane and treasonous.’ Yet others 
speak of school systems shaped not to help children get ahead but rather to give 
them a future “roughly congruent with the status of their parents” (Katz, 1971, p. 
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xviii). The genre includes The School Promoters (Prentice, 1977); Education in the 
Forming of American Society (Bailyn, 1960); and Class, Bureaucracy, and Schools 
(Katz, 1971). Each is a nagging reminder of how slow we are “revaluing the whole 
and reorganizing the broad outlines of the theme in the light of . . . [new] 
discoveries” (Butterfield, p. 5). 


If, then, historical research is intended to set the old story straight through 
more judicious counterpoise of breadth and depth, how have these balancing acts 
affected educational history? It can be said that the teaching of this subject to 
prospective educators these days draws on a wide variety of corrective historical 
enquiry. Attention to this research, moreover, selectively modifies the old story well 
beyond course descriptions that once boasted of an historical account from the time 
of the Ancient Greeks to the present. For those who write either texts or 
interpretations of extensive coverage and wide appeal, however, the task has proven 
less manageable. For example, editors of the text Canadian Education: A History 
(Wilson, Stamp & Audet, 1970) offered 21 chapters shared among nine authors and 
designed to correct the old story according to Bailyn’s precept that education is not 
just formal pedagogy but an entire process. This is not to say that these scholars 
merely patched the new research into the old story. It is simply to emphasize the 
difficulty encountered altering the bearing of the whole subject through complex 
editorial orchestration. Sweeping interpretations such as American Education: The 
Colonial Experience (Cremin, 1970), American Education: The National 
Experience (Cremin, 1980), and American Education: The Metropolitan 
Experience (Cremin, forthcoming) give rise to yet more severe correctional 
problems. One might expect that single authorship and major publication wou!d 
together produce the best combination imaginable to correct the old story. To be 
sure, Cremin mapped out the territory for a monumental work in light of revisionist 
corrections typified by Bailyn and others in the early 1960’s. Carl Bridenbaugh, for 
one, recognized its “resourceful and prodigious research, breadth of view, balanced 
judgment, effective organization, and . . . richly rewarding synthesis”.? So deep the 
commitment, so necessarily protracted the writing period, though, the vast 
scholarship thus generated presents a spectacular target for research challenges 
issued throughout its painstaking development. Hence Clarence J. Karier’s 
contention that despite Cremin’s wide focus on what constitutes education, his 
liberal theory precludes violence, exploitation, genocide and so forth “being treated 
as intrinsic elements of American history” (p. 459). The result, Karier charges, is 
new educational history based on an old theoretical model resulting in misleading 
optimism about America’s educational past. 

As part of this historiographic ecology where breadth vies with depth, Canadian 
writers of thematic provincial histories of education take up their pens at some risk. 
On the one hand, current issues such as ethnic minorities or regional separation 
prompt provincial researchers to explore topics with a view to explaining their local 
peculiarities. These scholarly adventures, well exemplified by Shaping the Schools 
of the Canadian West (Jones, Sheehan & Stamp, 1979) and Schooling and Society 
in Twentieth Century British Columbia (Wilson & Jones, 1980), are especially 
useful for teaching purposes. The best of them probe such hypotheses as that 
public schooling was the instrument of social conditioning for an urban industrial 
order. On the other hand, since Phillips’ The Development of Education in Canada 
(1957) and A Brief History of Canadian Education (Johnson, 1968), no Canadian 
scholar has managed in a single interpretive educational history to alter the bearing 
of the whole subject. The result is an ever-lengthening shelf of corrective 
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anthologies, monographs, and articles cumulatively critical of the same old stories. 
The provincial researcher might well wonder whether this emendation serves any 
purpose save perpetual correction in detail or whether it might some day prompt 
comprehensive synthesis in the bold style of a Cremin. 


Lifting particular research beyond mere parochial concerns requires accepting 
the latter proposition. It rests on the assumption that testing general hypotheses by 
reference to provincial data can enrich the fund of resources, in ascending order of 
an analytical refinement, from which persuasive new Canadian educational histories 
are likely to be refashioned, though not, admittedly, in the absence of new 
conceptual frameworks. The observations to follow are based on this premise. They 
arise from the impression that an historiographic lineage founded by MacLaurin 
(1936) has resulted in continued preoccupation with who was British Columbia’s 
first teacher.* They suggest that this distraction obscures any consideration of what 
teaching might have meant to pre-contact Pacific Northwest Indian society. They 
attempt to establish something of the range of educational background 
characterizing several teacher prototypes caught up in the business, administrative, 
governmental, and social beliefs of their colonial new world milieu. Finally, they 
preface further research in provincial teacher education. The purpose is 
re-examination of yet another old story about the evolution of Normal Schools and 
Faculties — a story which in the light of current charges of irrelevance, boredom, 
triviality, and simplemindedness, deserves attention.‘ 


I 


Teaching in what we now call British Columbia did not begin with Hudson’s 
Bay Company initiatives. It began, if one ever begins anything so subtle as 
teaching, in pre-contact Indian society. As Robin Fisher (1977) warns, it is 
dangerous to make “generalizations about the Indians of B.C. as if they were 
culturally similar” (p. xiv). But throughout the region, teaching seems to have 
involved all reputable elders in fairly close agreement on what knowledge was 
most worth knowing. There were skills necessary for physical survival — flaking 
points, spearing salmon, and so forth.® Then there were attitudes toward the way 
society got along — respect for age; acceptance of rank; that sort of thing. Finally 
there were spiritual beliefs that helped people conceive of their relationship with 
the non-human world, realized through myths which explained origins and 
life-cycles and upheavals; thanksgiving ceremonies; healing methods; dances; secret 
societies. 


Because of the closeness of survival skills and social conventions to daily 
existence, the teaching role must have been in many instances undifferentiated. 
Reconstructing the days of old, Kwakiutl Chief Charles James Nowell relates how 
as a boy he lay beside his father at night and listened to stories of ancient 
potlatches. Kwakiutl Chief James Sewid left similar accounts of how, around the 
fire, the old people would “pound it into my head to know my relatives” 
(Spradley, 1969, p. 31).° Jenness described particular teaching acts among the 
Carrier. Before Tsimshian contact, “the boy followed his father and uncles to the 
chase, the girl helped her mother and aunts in all the domestic duties of camp 
life” (Jenness, 1929, p. 96). After cultural overlap with Tsimshian ways, Carrier 
parents and grandparents taught little children. When these grew older, “the 
mother’s brothers and sisters played an important role” (Jenness, 1929, p. 99) 
teaching skills and customs, gradually tightening supervision to ensure correct 
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performance and understanding. It would be absurd at this point to speak of 
teacher education, a term no _pre-colonial Indian could possibly have 
comprehended. In general, though, becoming a teacher required mastery of a 
particular piece of tribal or clan existence and mastery as well of the gamut of 
assumptions surrounding it. To illustrate, a Lower Lillooet Indian father teaching 
his son salmon fishing would first need to know not only the technology of weir 
construction but also the nature and purpose of the salmon ceremony which 
included prayer and complex rules governing ritual consumption without which 
future runs might be expected to fail. 


Teaching, however, could constitute a more self-conscious act too, being clearly 
a matter of social delegation or political appropriation. At potlatches, for instance, 
chiefs reinforced the present status of the rank-ordering system.’ On other 
occasions seasoned members of secret dance societies taught specific skills and 
subjected novices to stringent initiatory ceremonies. By comparison, public 
orators, appointed by chiefs, first mastered the language, its myths, its imagery, its 
powerful delivery, preliminary to oratorical performances upon ceremonial or 
negotiating occasions. 


Teaching carving or the mysteries of the shaman also called for wide cultural 
as well as technical preparation. A Tsimshian artist born about 1875, Brian Peel 
apprenticed as a child to Squadin, Aiyansh Chief and a noted wood carver. As a 
youngster, Peel had to eat specially-prepared food in a prescribed way, watch 
other carvers at work, and practise himself with a small adze and knife. Later, he 
worked with Squadin’s gifted associates, specializing at length in ceremonial masks 
with movable parts. By 1910, like antecedents who never saw white men, he 
became a master carver ready, in turn, to reveal the craft to reliable beginners. 
Instructing a future shaman in readiness for a teaching role intended to reinforce 
beliefs about body, soul, and spirit was just as exacting. The Upper Stalo 
preferred induction from childhood. Interested children of good character learned 
from shaman parents or grandparents or from an older unrelated shaman retained 
by parents for the purpose. They were told that solitary wilderness quests would 
result in encounters with guardian spirits who “instructed the individual, then .. . 
departed” (Duff, 1952, p. 99). They received help from established shamans in 
cultivating clairvoyance, sleight of hand, ventriloquism, and perhaps hypnotism, at 
age 30 or so finally teaching their people of a spiritual order of things permeating 
the daily round. 

Diverse as parents, relatives, elders, chiefs, dancers, orators, carvers, or 
shamans might have been in their various teaching functions, training for these 
tasks included common elements. These were observation of the skill in practice; 
trial and error participation with no stigma attached to honest stumbling; and 
spiritual instruction to afford the skills in question the reverence they deserved 
and to place them in the broadest possible human context. 


II 


In sharp variance to these deeply contextual ways of becoming a teacher are 
patterns associated with postcontact British Columbia and reflecting old world 
assumptions. Though different one from another, they overlap in one principal 
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respect. Each holds teaching to be a specific function carried out in a school by a 
legitimized agent — a teacher who can be formed for the purpose. Otherwise, 
approaches presume special priorities according to their cultural points of 
departure. A note on each of three 19th century British Columbia teachers’ might 
serve to illustrate this last contention. 


The first such figure is the Reverend Robert John Staines whom James 
Douglas brought to Fort Victoria in 1849. Staines was born in Oundle, 
Northamptonshire, where he attended grammar school. At age 19 he entered 
Cambridge. The next decade witnessed a series of creditable accomplishments — 
recommendation as vice-principal, St. Mark’s College, Chelsea; assistant master 
(classics and mathematics), Derby Grammar School; recommendation as master of 
St. Asaph’s Grammar School; tutor in Ireland to a military officer’s family; 
proprietor of his own private school at Boulogne sur Mer, France; ordination as 
Anglican priest through the good offices of influential clerics and administrators. 


Why, then, did the Hudson’s Bay Company employ as teacher this man who, 
as it turned out, proved to be one of the company’s most tiresome political 
antagonists?® In general it did so because recruiting and retaining manpower on 
the frontier improved with the promise of “normal” family existence, such 
normalcy including children’s schooling. In particular, Douglas’ expectations about 
becoming a teacher rendered Staines a prestige candidate for the job. In 1840, he 
had been admitted to St. John’s College, Cambridge, where for four years he read 
mathematics with the College’s 11th. Wrangler, F.B. Scott, M.A. Afterwards, he 
transferred to Trinity Hall, Cambridge, read mathematics and classics, and took 
his Bachelor of Arts degree in 1845. Within three years, he could boast that his 
studies fitted him to teach “Classics, Mathematics, and every branch of the usual 
routine of an English education, including drawing if required” (Slater, 1950, p. 
191). The claim underscores two beliefs about teacher education commonly held 
130 years ago. 


The first of these is that teaching is “an art, learned from experience and 
association” and dependent upon mastery of a discipline. It is not to be confused 
with methodology or strategy abstracted beyond the disciplines. Teaching thus 
construed may not be divorced from learning. In fact, learning is teacher 
preparation. The view is central to Staines’ professional testimonials. And it is 
unselfconscious. Scott of St. John’s College “certified that Staines was his pupil in 
mathematics and that he considered him fully qualified to teach the first branches 
of that science” (Slater, p. 189). Marsh of Trinity Hall “considered him a good 
classical scholar and possessed of considerable abilities, such as would enable him 
to undertake the tuition of boys preparatory for the universities” (Slater, pp. 
189-190). Layng of Oundle Grammar School related his merits as a teacher to his 
scholarship, including English literature, and referred to his “aptitude.” 


The second conviction is that since education is a moral enterprise, becoming a 
teacher presupposes religious instruction. In this light, Staines’ supporters spoke 
of him favourably. William Layng noted his deep attachment to the Church of 
England. Dean Newland of Ireland styled him “a faithful member of the Church” 
capable of laying a solid religious foundation in his pupils’ minds. Staines himself 
declared “religious instruction in accordance with the doctrines of the Church of 
England” (Slater, p. 191) were indispensable to education. As late as 1850 when he 
was entertaining second thoughts about the schoolmaster’s appointment, Governor 
Douglas conceded that Staines’ attention to religious training lay beyond 
reproach.” 
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Ill 


A contemporary of Staines, the Reverend Honoré Timothy Lempfrit O.M.L., 
typifies a second priority in the business of becoming a teacher. Born in Lixheim, 
France, Lempfrit left Oregon territory in 1848 on a mission to Vancouver Island 
Indians. Without Hudson’s Bay Company funds but initially afforded a shelter, he 
started a school for the Company’s Canadian servants, that is, for the wives and 
children of French Canadian employees. In 1851, he resumed work with the 
Cowichan Indians, at length departing for San Francisco in 1853 to rejoin his 
order." 


How did this 19th century Catholic become a teacher? In Lempfrit’s youth it 
was believed that priests “should be chosen young and educated in an intensely 
Catholic atmosphere” (Anderson, 1975, p. 110). Around the age of 12 it is probable 
that he entered a petit séminaire and in due course a seminary under episcopal 
control. Traditionally, the petit séminaire might be either an institution separate 
from others, with a classical curriculum similar to that of competing secondary 
schools, or one adjunct to the seminary. Promising boys who could read and write 
were admitted to learn grammar, plain song, belles lettres, the scriptures, 
sermon-writing, and ritual. They wore ecclesiastical habit, were tonsured, and 
subject to clerical discipline. Between the old régime and the Second Republic, 
their curriculum came more to resemble that of the collége. After the Restoration, 
certain of them adopted less-severe dress requirements and were at liberty to 
follow the subjects of the lycée, notably, Latin, Greek, and French, philosophy, 
history, modern languages, and elementary science. In short, the petit séminaire 
of Lempfrit’s day was designed to provide a classical secondary education in a 
setting conducive to spirituality and religious observance. 

Conversely, the seminary was largely devoted to final, thorough preparation for 
priestly duties, culminating in ordination. McManners describes the tradition as it 
once applied to the Grand Séminaire at Angers. Some seminarians followed 
theological lectures for a year, others for three years. But it was meditation, 
liturgy, and orthodoxy rather than scholarship that occupied most candidates. 
Barnard writes of grands séminaires established in Paris and open to young men 
of eighteen who had “received a general education elsewhere.” They exacted strict 
obedience to regulations, especially the use of Latin, and learned grammar (i.e., 
elementary classics), Latin and Greek, theology, philosophy, and the sacraments. 
Mousnier (1974) outlines the education of French Catholic clergy, observing that 
earlier seminaries emphasized plain chant, liturgy, the sacraments, and scripture. 
Over the 18th century, philosophy, theology, canon law, and Hebrew appeared on 
the course of studies. The result, some held, was a clergy “plus intellectuel, plus 
raisonneur . .. , plus conscient de ses devoirs” (Mousnier, pp. 274-275). 
Nevertheless, shortly before Lempfrit was born, Emery, the Sulpician Superior of 
the Angers Seminary, captured the spirit of a seminarian’s professional 
preparation. He is said to have remarked that the first and last goal of that 
Institute was “to live for God in Christ Jesus our Lord” (translation from 
Mousnier, p. 275). In Staines’ case, it might be argued, religious instruction was 
part of becoming and being a teacher. With Lempfrit, religion was central. For 
him, the aim of education was salvation. Who, then, could become a teacher who 
did not know along which path to find it? 
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IV 


Representing a third route to becoming a teacher is John Jessop, born in 1829 at 
Norwich, the English cathedral town in whose basillica the Reverend Robert 
Staines was ordained when Jessop was 19. After schooling in Norwich, in 1846 he 
sailed for Boston with his parents, eventually arriving in Toronto. There, he 
graduated from the Toronto Normal School, journeyed to Victoria, and after trying 
to make a go of things as prospector, political aspirant, printer, and teacher, 
secured preferment as British Columbia’s first Superintendent of Education under 
the 1872 Public Schools Act. In his day, the notion emerged that teaching may be 
learned through a fairly short period of special study and practice. This way of 
thinking, by 1860 accepted in several European countries and American states, 
characterized Egerton Ryerson’s work at the Toronto Normal School.’ Jessop and 
others familiar with the idea supported it as directly relevant to B.C. educational 
needs. As they apply to Ryerson’s institution, three aspects of the supposition are 
of interest here. 


One was that normal school training depended on an admission process that 
screened for character, “capacity, and disposition.” Initially, character was assessed 
through testimonials from clergymen. Capacity and disposition, however, were 
monitored throughout the term. Thus, anybody inattentive forfeited his free books, 
tuition, and subsistence allowance. Worse yet for triflers, incapacity, negligence, or 
misconduct led to dismissal. In fact, during the first seven years from 1847, 6% 
were required to withdraw and another 14% denied admission. Those who remained 
acknowledged stiff regulations on personal conduct — especially women who, as 
residents in a strange city, were held to be “particularly exposed to intrusions and 
dangers” (Hodgins, 1911, p. 274). 

A second feature of Ryerson’s scheme was the comprehensiveness of its subjects 
of instruction. Trainees demonstrating only passing acquaintance’ with common 
school knowledge would simultaneously learn the subjects and how to teach them. 
Their instruction was quite rudimentary, amounting to language skills, history, 
geography, mathematics, science (including agriculture), music, and “art of 
teaching,” explained at the normal school and tried out in the adjunct model school. 
This latter art included individual instruction, simultaneous class teaching, and the 
so-called monitorial method by which “some of the more advanced pupils are 
employed to teach the less advanced” (Hodgins, p. 37). 

A third provision of Ryerson’s normal school was religious teaching consisting 
of weekly two-hour classes conducted by a visiting clergyman according to 
candidates’ denominations, plus mandatory attendance at associated Sunday 
services in the Toronto area. This practice constituted Ryerson’s answer to critics 
of allegedly Godless public schools or to those who feared public schools might 
theologically disrupt communities of particular religious persuasions. It was 
designed to convey a form of liberal Protestantism. Jessop, a Methodist, seems not 
to have found it inappropriate. Nor, for that matter, did those who would legislate 
into being the B.C. “secular” public school system it would be Jessop’s job to 
superintend. 

V 


To review, we first speculated on several contextual ways of becoming a teacher 
in precontact Indian society. Then we contrasted certain imported beliefs on the 
subject. One was that to know is to know how to teach. A second was that 
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paramount to teacher preparation are morality and religion. A third was that 
distinct from (though not, of course, without) knowing, teaching can be taught. It 
remains to suggest how, drawn from provincial historical data, such findings might 
help illuminate a history of teacher education if not alter forthwith the bearing of 
the whole subject as Butterfield would perhaps have wished. 


To begin with, on at least three counts, reference to Indian traditions about 
becoming a teacher is more than a timely political nod. For one thing, since they 
concern themselves primarily with schools, Whiggish chroniclers ignore the 
cultural diffusion of teaching in Indian society. In so doing, they fail to examine 
an important historical variable affecting subsequent institutionalized education of 
Indian children under missionary or governmental auspices. For another thing, the 
very diffuseness of these ancient aboriginal teaching roles obliges educational 
historians to ponder with some care the actual effect on becoming a teacher of this 
or that programme, given the formative influence upon any teacher of so many 
other life experiences. Not unconnected, there remains yet untold the story of 
preparing Native Indian teachers for service in public education. Toward this end, 
John Andrews wrote that “at the present time, there are only approximately 25 
certificated Indian teachers working in [B.C.] schools. If their number were 
proportionate to their population, there would be approximately 600” (Andrews, 
1974, p. 114). The Native Indian Teacher Education Program at the University of 
British Columbia and similar schemes elsewhere in Canada have been devised to 
help correct this disproportionate Native teacher representation. Their efforts 
stand to gain through historical understanding of becoming a teacher in past 
civilizations uncomplicated by white intrusion. 


Similarly, the cases of prototypical characters such as Staines, Lempfrit, and 
Jessop force educational historians to reconsider the Whiggish old story of public 
education and the normal school triumphant. Possibly because Jessop’s ideal 
world of publicly-funded, selective, prescriptive, morally-compelling teacher 
preparation ran parallel to that of Whiggish administrator/historians, the Staines 
and Lempfrits of the play remain twilight figures drowning off Cape Flattery or 
abandoning B.C. in favour of California. The inevitability of and sequential 
improvements through teacher examinations, normal schools, and education 
faculties colour ensuing accounts. 

In the case of Staines, what is lost is an Oxbridge heritage, namely, that the 
professional consequence of scholarship could be fellowship, tutorship or 
schoolmastership and that, as a function of scholarship, resultant ability to teach 
rests unquestioned. Nor is this a matter of foisting such a perspective on B.C. 
educational history. It already appears in the purview of independent educators 
who can write that “if we make it impossible in future to hire from the pool of 
unaccountable thousands who have excellent academic qualifications, and even 
outstanding teaching records, but no teacher training, we shall be denying 
ourselves some valuable opportunities.” It already provokes debates over the 
liberal and technical in teacher education. Thus, some observers stress the central 
importance of the universities. Others report that “university teacher education 
was universally alleged to have been useless” and to emphasize in its stead “a 
short period of intensive skills training . . . on how to handle yourself effectively 
in a classroom” followed by “a long period of apprenticeship, as much as three 
years” (Flanders, 1980, p. 7). In short, it is a perspective lacking in which any 
historical rendering of B.C. teacher education approaches caricature. 
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In Lempfrit’s case, educational histories affording him mere honourable 
mention thereby underestimate the historic place of religion in becoming a teacher, 
thus the magnitude of differences on the subject marking even so “secular” a 
context as that of British Columbia. Granted, Staines was schooled in accepting 
close Church/state/education ties. And both Staines and Jessop came to believe 
that the objective of religious education was moral rectitude. But Staines’ 
ordination was precipitate. And as with his peers, Jessop’s Normal School religious 
preparation proved relatively incidental, reflecting Ryerson’s belief in the value to 
future teachers of supposedly general religious truths culled from the scriptures. 
How different Lempfrit’s edification, all aspects of the process permeated by 
religious doctrine and observance. But as Wilson recently showed (Wilson & Jones, 
1980), old stories that see in B.C.’s free public nonsectarian schools the dawn of a 
better day tend to gloss over opposition from the private sector. By the same token, 
historians who, alluding to taciturn Canadian political leaders, can quip that they 
“may have preferred to let sleeping dogmas lie” are unlikely to appreciate the 
educational positions of Roman Catholic, Doukhobor, Mennonite, Christian 
Reformed, Seventh Day Adventist, or other church groups. As it applies to 
becoming a teacher, however, reorganizing the broad outlines of the theme requires 
not reducing but magnifying “conflicts and tensions” the better to understand an 
educational perspective which might wish to place salvation above patriotism, 
adjustment, citizenship, or economic security. In this regard, Downey’s incisive 
analysis, The Anatomy of a Policy Decision: B.C.’s Bill 33 — The Independent 
Schools Support Act (Downey, 1979), furnishes an arresting historiographic 
example of such a procedure. Articulating competing ideologies of public as 
opposed to independent educational alternatives, it points to the steady growth of 
denominational schools and to B.C. government concessions (support for texts, 
transportation, health services) initiated as early as 1951. No less pertinent to the 
issue of becoming a teacher, as research in detail Downey’s enquiry documents 
nearly 700 cases of permanent certification based on experience or on qualification, 
other than university teacher education, acceptable to the Independent Schools 
Certification Committee. 


Unlike Staines and Lempfrit, Jessop emerges from standard discussions of 
teacher education as a decided friend of progress, thanks in large part to Johnson’s 
attractive portrait of John Jessop: Gold Seeker and Educator (Johnson, 1971). 
Indeed, there is no denying Jessop’s fitness for this historical standing. If references 
to his correspondence and later Reports are any measure, he provides a direct link 
with the perception that teaching can be taught. After at least two decades of 
ceaseless professional recommendation to this effect, punctuated by tentative 
enabling legislation, he lived to see his vision of a Normal School in the style of 
Ryerson’s Toronto establishment at last realized, albeit modestly for a start. The 
perspectives for further research in detail are twofold: 


First, no small nostalgia surrounds the now-superseded normal school. Some 
lament the passing of its legendary capacity to train candidates in management 
skills, exercising discipline, classroom housekeeping, the orderly use of supplies, and 
the settling down of classes to quiet learning.'® In contrast to this encouraging 
recollection is Vancouver Normal School Principal William Burns’ record for 
posterity of unfulfilled repairs, insufficient lighting, complaints on admissions, 
underage students, weak discipline in the model school, scanty funds, theft in the 
cloak room, rude tram conductors, and other contretemps plaguing the 
administrative scene. Likewise in contrast are fifteen recommendations of the 
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Survey of the School System (Putman & Weir, 1925) implying low admission 
standards, short sessions, parochial outlook, isolation from the classrooms of the 
public system, inadequate practice, outdated curriculum, and dull lectures from 
underqualified instructors. Faced with this conflicting evidence about a model 
periodically invoked for the purpose of highlighting today’s conjectural 
shortcomings, historical researchers will want to reassess the early work of the 
normal school in order that such comparisons may be intelligently drawn," 
especially should present policy decisions depend on them. 


Second, the old story that moving teacher education to the university was an 
unmitigated advance leaves much unexamined. For the tale obscures resultant 
professional, and political friction, both on and off campus, so vital to 
understanding how participating teachers, trustees, ministry officials, professors 
and candidates perceive their rights, responsibilities, and rewards. It fails to query 
the consequences of leaguing ex-normal school personnel with freshly qualified 
educational academics as Canadian universities followed Alberta’s lead in the 
matter. Nor does it account for latter-day allegations that an education faculty 
could be “poised on the perimeters of the University’s intellectual life,” having 
acquired a “tarnished image” through its “Mickey Mouse” courses and research 
amounting to “organized trivia.”'* Patent, therefore, to revaluing the whole is 
recognition of the intense cultural significance of becoming a teacher that discord 
over its manifest perspectives should continue to resound so stridently, so 
frequently, and from so many quarters. 


Notes 
iL The general thought is from Bailyn (1960). This way of putting it I attribute to my colleague W. A. Bruneau. 
De Bridenbaugh joins Robert Middlekauff, John Demos, and Merle L. Borrowman in praising Cremin’s first volume 
of the trilogy. The publishers availed themselves of these endorsements. 
3. Other works interested in the “first teacher” question from the standpoint of conventional European teacher 


models include D.A. MacLean, Catholic Schools in Western Canada: Their Legal Status (Toronto: The 

Extension Print, 1923); M.A. Ormsby, British Columbia: A History (Vancouver, B.C.: Macmillan of Canada, 

1958); M.R. Lupul, “Education in Western Canada Before 1873,” in Wilson, Stamp, & Audet (1970); Phillips 

(1957); Johnson (1964). 

See The Committee on Teacher Education (1974). 

For fascinating graphic and textual presentations, see Stewart (1973, 1977). 

For my discovery of Spradley (1969) and Ford (1941) I am most grateful to my friend Peter Lando. 

Scholars differ as to whether or not Indian societies harboured a concept of class. Usher compared Drucker’s 

theory of finely-graduated statuses with Boas’ assumption of social class distinctions. For development of either 

theme see Philip Drucker, “Rank, Wealth and Kinship in North Coast Society,” American Anthropologist, 

January-March, 1939, 41, Number 1, 55-64, and Franz Boas, Tsimshian Mythology (Washington, D.C.: Bureau of 

American Ethnology, 31st Annual Report, 1909-1910), both cited in Jean Usher, William Duncan of Metlakatla: 

A Victorian Missionary in British Columbia (Ottawa: National Museum of Canada, Publications in History, 

Number 5, n.d.). 

8. These prototypes are by no means presumed to exhaust the subject. For example, the preparation of women to 
teach in the many B.C. private schools, religious or otherwise, might very well be a rewarding topic to explore. 
Alison Prentice, Neil Sutherland, and Juliet Pollard have drawn this matter to my attention. 

9: As Rothblatt observed in his The Revolution of the Dons, the donnish attitude to professionalism such as 
Staines would have encountered at Cambridge rested on an “ideal of service” antithetical to trade or industry 
with its concern for profit, the market, advertising, and competition. Staines was not just a person broadly 
educated by standards current in his day. He was also a professional — a priest ordained for chaplaincy; a 
Cambridge man schooled for teaching. At Fort Victoria, he sustained a head-on collision with a pioneer branch 
of the Hudson’s Bay Company, one of the world’s most powerful joint stock companies. Accordingly, Staines’ 
eventual clashes with those responsible for the H.B.C. cannot be caricatured only as the anarchical capers of a 
political agitator. They were as well the impact of a resolute Victorian professional upon resolute Victorian 
traders who adopted different vantage points from which to survey their 19th century colonial world. 
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10. The account of Staines is drawn from the generously-documented article by Slater (1950). I am much indebted 
to this author. The quarrel between Staines and Douglas was complex, but essentially stemmed from differences 
concerning three sorts of matters. The first was what Staines alleged to be Company lethargy in providing such 
facilities as housing, a school, a church, a burial ground. The second had to do with Douglas’ charge that 
Staines overstepped his authority upon occasion and meddled with Company business. The third Slater has 
astutely characterized as conflict between the Company as fur-trader and the Company as colonizer and the 
effect of this conflict on land prices. 

Die I am indebted here to Sister Mary Down (1962). She in turn has consulted data from the Archives, Missionari 
Oblati Di Maria Immacolata, Via Aurelia 290 Roma (629), Italia. Doubtless these resources hold further clues to 
Lempfrit’s background. 

12 See Hodgins (1911). In 1844-1845, Egerton Ryerson visited and admired normal schools in France, Holland, 
Austria, and Germany. London’s normal schools he considered excellent, though not so advanced as the Dublin 
counterpart run by the Irish National Board. He heaped high praise on similar institutions in Glasgow and 
Edinburgh, and later rendered an account of the impressive State Normal School at Albany, New York State. 

13. Jessop seems to have been more advanced than the general run of candidates in need of instruction in common 
school subjects. In May of 1853, he enrolled in the Toronto Normal School, having satisfied the authorities that 
he could read and write as well as do arithmetic, manage Latin and Greek besides, and was of good moral 
character. At the time, completion of one five-month session qualified candidates for the Second Class 
Certificate which Jessop achieved in October, 1853, leaving to teach in Darlington. Then in May, 1855, he 
re-entered the Normal School, completed a second session at high standing, and was granted a First Class 
Certificate which he put to immediate use teaching at Fingal and later at Whitby. I thank F. Henry Johnson 
whose John Jessop (1971) includes a succinct summary of this phase of Jessop’s career. 

14. See Alan Brown, Headmaster, and K.T. Stevenson, Board Chairman, St. Georges School, Vancouver, to Parents, 
October 14, 1977, p. 2. 

15. See Johnson (1964), p. 39. 

16. See Board of School Trustees, District Number 28 (Quesnel), Open Letter to all Faculties of Education in the 
Province of British Columbia (Quesnel: Board of School Trustees, July 21, 1974). 

Lie See my article “A Letter from Quesnel: The Teacher in History and Other Fables,” History of Education 
Quarterly, Summer, 1975, 15, 2, 131-145. 

18. Gilmour (1979). In his admirable chapter “History of Teacher Education in Alberta” (Jones, Sheehan, & Stamp 
1979), p. 1938, Robert S. Patterson has correctly explained the use of such data. “The intent,” he writes, “is not 
to deprecate the work and achievements of many individuals and groups, but to provide a perspective intended 
to make us more critical of our past with the hope that we will more appropriately address our current and 
future needs.” 


This perspective is based on an invited address delivered to the Canadian Association for Teacher Education, 
Université du Québec A Montréal, June 3, 1980. Thanks are due U.B.C.’s Faculty of Education for support of certain 
research phases. 
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Ethnography: An Adventure 
in Interpretive Research 


Patrick A. Duignan 
The University of New England 


The current emphasis on qualitative research in the social sciences has 
brought with it a plethora of terms to describe this form of research. One term, 
ethnography, which describes a research method of long standing in the field of 
anthropology, is receiving much attention in the literature on qualitative re-earch 
methodology. Ethnography, variously known as naturalistic description, analytic 
description, constructivist approaches or fieldwork is, however, not really a 
method but is more accurately described as an intellectual activity on the part of 
the researcher. This important distinction will be explained more fully in the 
following section. 


Ethnography: What Is It? 


The definition of Ethnography presented in this paper is that of Geertz (1975, 
p. 6) who suggested that it was, primarily, “an elaborate venture in ‘thick 
description’.” Thick description, Geertz explained, constitutes an “extensive 
description and interpretive effort at explaining the complexity of social 
discourse.” Acknowledging that the term “thick description” was coined by Gilbert 
Ryle, Geertz borrows one of Ryle’s examples to elucidate its meaning. This 
example so clearly illustrates what thick description means that it is reproduced 


here. 
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Consider . . . two boys rapidly constricting the eyelids of their right eyes. In one, this is 
an involuntary twitch; in the other, a conspiratorial signal to a friend. The two 
movements are, as movements, identical; from an I-am-a-camera, ‘phenomenalistic’ 
observation of them alone, one could not tell which was twitch and which was wink, or 
indeed whether both or either was twitch or wink. Yet the difference, however 
unphotographable, between a twitch and a wink is vast; as anyone unfortunate enough 
to have had the first taken for the second knows. The winker is communicating in a 
quite precise and special way: (1) deliberately, (2) to someone in particular, (3) to 
impart a particular message, (4) according to a socially established code, and (5) 
without cognizance of the rest of the company. 


Ethnography, therefore, if used synonymously with thick description, focuses on 
the meaning of behaviour. It is based on the premise that behaviour is not an end 
in itself; it is purposeful. Consequently, the ethnographer strives to understand 
the meaning actors give to their behaviours and actions. Explanations of human 
behaviour must take into consideration the meanings which those involved assign 
to their acts. Max Weber has pointed out that man is an animal suspended in 
webs of significance he himself has spun. The ethnographer tries to unravel these 
webs so as to understand more fully the “world view” of those he observes. The 
meanings others ascribe to their actions are, of course, not always self-evident and 
actors may consciously or unconsciously camouflage their real intentions. As 
Weber (1964, p. 111) so ably points out: “In the great majority of cases actual 
action goes on in a_ state of inarticulate half-consciousness or actual 
unconsciousness of its subjective meaning.” Kaplan (1964, p. 127) captures the 
essence of the argument that the ethnographer’s task is essentially one of a search 
for hidden meaning. Concerning the task of observation he notes: 


Observation is purposive behavior directed towards ends that lie beyond the act of 
observation itself . . . . When observation is thought of as passive exposure to 
perception, its instrumentality is left out of account .... In science observation is a 
search for what is hidden, not just because it ts hidden, but because its exposure will 
facilitate an intimate, sustained, and productive relationship with the world. 


The ethnographer, then, must endeavour to capture the essence of an event. It 
is not sufficient to record the event for its own sake; he must try to interpret the 
meaning of the event for those involved. He must try to discover both the 
apparent and unapparent import of actions and events. 


Geertz described ethnography as a venture in description and interpretation. It 
could perhaps be better described as an adventure — an adventure in interpretive 
research. The reader will, I believe, more fully appreciate the adventurous nature 
of this form of research after reading the following report on an observational 
study of the administrative behaviour of superintendents conducted by the author. 
The remainder of this paper focuses on the processes and procedures of the 
research and on the problems encountered in the use of this type of methodology. 
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An Observational Study of Superintendents 


The author conducted an observational study of the administrative behaviour 
of eight superintendents in the province of Alberta. The central problem of the 
study was to observe, describe, and analyze the actual on-the-job behaviour of the 
superintendent of schools so as to develop a composite view of the nature of his 
administrative behaviour. The focus, as in Geertz’ definition of ethnography, was 
on description of behaviour and then on interpretation of that behaviour. 


A field-study approach was employed to generate an analytic description of the 
nature of the superintendent’s work. Structured observation was used as the 
primary method of data collection. Mintzberg (1973, pp. 228-229) suggested that 
the structured observational method has certain characteristics that make it 
germane to the study of work content. He pointed out that this methodology 
combines the “inductive power of observation” with the “structure of systematic 
recording,” thus enabling the researcher to “study systematically and 
comprehensively those parts of managerial work that are not well understood.” 
This methodology allows the researcher to be more flexible in his observations and 
recordings allowing him to impose a structure during or after, rather than before, 
he collects his data. It combines the flexibility of unstructured (or open-ended) 
observation with the discipline of systematic selectivity of data. 


Data Collection 


Structural data were collected on the pattern of each superintendent’s 
administrative behaviour throughout every minute of each day during the 
observation period and on all his mail and verbal contacts. A number of steps 
were used in recording and organizing these data. Examples of these steps are 
given in the next sections. 

Observational notes. The details of the observed administrative behaviour of 
each superintendent were kept in a series of observational notes. Schatzman and 
Strauss (1973) recommended that a researcher develop a comprehensive set of 
field notes on his observations. These notes contained as little interpretation as 
possible and referred to the who, what, where and how of an activity. 

The next step in the processing of the data was the construction of three 
different records from the observational notes (adapted from Mintzberg, 1968). 
The chronology record described the sequence of administrative behaviour in 
terms of major categories of activities — units of behaviour. The contact record 
described each contact that the superintendent made and the mail record 
described each piece of incoming and outgoing mail. 

In addition, high-level inference data were generated through the use of 
theoretical notes (explained in a later section) and an analysis of the “purposes” 
for which an activity was initiated. 


Data Analysis 
Low Inference Level Data Ls . 
Profiles were constructed for the superintendents detailing the proportions of 
their working time devoted to each of seven major activities identified ‘ 
Mintzberg (1968) — unscheduled meetings, scheduled meetings, telephone calls, 
desk work, travel within their systems, evening meetings, and tours and visits to 
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schools. In addition, a composite picture showing these same categories was 
constructed for all eight superintendents. Profiles, using the composite scores for 
all eight superintendents, were developed to answer Guest’s (1960, p. 225) 
“deceptively simple classificatory question, Who does what, with whom, when, and 
where?” 


High Inference Level Data 


The high inference level observations consisted, primarily, of two types of data: 
(1) theoretical notes developed by the researcher and (2) categories of purpose 
developed to describe the purposes for which the various verbal and written 
contacts of each superintendent were initiated. An explanation of the development 
of these two analysis techniques follows. 


Theoretical notes (TN’s). Schatzman and Strauss (1973) recommended that a 
field researcher develop theoretical notes as well as his observational notes during 
and after the observational period. In the field situation the researcher is 
constantly trying to make sense out of what he observes. One method for doing 
this is to develop tentative working hypotheses to provide a number of foci for the 
mass of data recorded from the observations. 


It should be noted that data collection and data analysis are not necessarily 
distinct and separate phases in field research. The field researcher usually engaged 
in data analyzing activities early in the research process. While writing the 
observational notes, the researcher also attempts to add to them some reflections 
and interpretations. Schatzman and Strauss (1973, p. 95) suggested that such 
attempts by the researcher “represent self-conscious, controlled attempts to derive 
meaning from any one or several observation notes.” Through the development of 
theoretical notes the researcher strives to make conceptual sense of observed 
events. 


On the one hand, these notes may consist of a detailed elaboration of a 
particular inference or, on the other hand, they may constitute an attempt to link 
a number of previously stated inferences into a more abstract statement. The 
development of theoretical notes is equivalent to the process of data interpretation 
referred to by McCutcheon (1978, p. 11) as “patterning.” She described patterning 
as “an attempt to see how each discrete fact relates to the total matrix of other 
facts collected in the . . . situation.” By constructing patterns of observed 
activities, McCutcheon suggested that “discrete actions are given coherence, form, 
and meaning; they do not remain random isolated acts.” 


The theoretical notes, therefore, constitute a record of the development of the 
researcher’s thoughts about the meaning of the observed data. They often note 
and describe events that have been frequently observed. Schatzman and Strauss 
(1973, p, 109) suggested that “the systematic development of theoretical notes can 
be thought of as a preliminary analysis.” 

The purposes for which verbal and mail contacts were initiated. The analysis 
of the purpose for which each contact and mail item was initiated constituted an 
important part of the analytic description of the nature of the superintendent’s 
work. Mintzberg (1973, p. 245), in evaluating the importance of the analysis of the 
“mail and contact purposes” in his study, stated: 
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The key to our study was the categorization of the purpose. This describes the 
essential content of managerial activity — in raw form, what five managers did — and 
it is what led to the development of the theory on roles. 


The classification of purpose was a difficult task. The content of each contact and 
mail item was examined carefully and the question was posed: “What aspect of the 
administrative task is this concerned with?” The method used to fit purposes into 
the various subcategories of the superintendent’s behaviour was to allow the data 
to suggest a particular category and strive for consistency through the use of what 
Glaser (1965, p. 220) referred to as the “Constant Comparative Method.” He 
suggested that when coding an incident for a category the researcher should 
“compare it with the previous incidents coded in the same category.” This process 
was rigorous, demanding and time-consuming. It necessitated the rereading of the 
original field notes on numerous occasions until such time as the researcher 
became thoroughly familiar with the process. Even then, some of the new data on 
purposes would not “fit” into the established categories and, consequently, new 
categories had to be established to accommodate some of it. 


There were, however, additional data that defied classification. Such data were 
presented in their “raw” state to support specific conclusions. This was done in 
response to Geertz’s (1975, p. 20) caution that the ethnographer must be wary of 
trying to impose neat categories and classifications on all his observations. He 
suggested that human behaviour is very complex and all attempts to: 


Set forth symmetrical crystals of significance, purified of the material complexity in 
which they were located, and then attribute their existence to autogenous principles of 
order, universal properties of the human mind, or vast, a priori Weltanshauungen, is 
to pretend a science that does not exist and imagine a reality that cannot be found. 


Kaplan (1964, pp. 26-27) emphasized this same point. He suggested that the 
researcher’s imposition of a logical order on his data may be “too good to be true.” 
Quoting Scriven, he pointed out that we must be wary of: 


the logician’s perennial temptation — make the portrait neat and perhaps the sitter 
will become neat. Usually there is more to be learnt from a study of the disarray than 
is gained by intentionally disregarding it. 


Kaplan went on to suggest that what we see as a disarray is to the more 
perceptive eye “a style of dress both useful and elegant.” 

The development of the theoretical notes and the purpose categorizations 
constituted an attempt to derive inferences from the observations. The meaning of 
events to the superintendents themselves was consistently checked by the 
researcher in both formal and informal conversations. The research conclusions 
depended heavily on the researcher’s interpretation of behaviours and events and 
on a comparison of these with the interpretations of the respondents. The 
research was then a “venture in thick description’—an adventure in interpretive 


research. 
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This form of research has some definite advantages in conducting field studies, 
but there are also a number of problems associated with it. After a brief 
consideration of its strengths, I want to examine a number of its limitations and 
suggest some steps that a researcher could employ to minimize such problems. 


Strengths and Weaknesses of the Ethnographic Approach 


The research instrument in a study of this nature is really the observer 
himself. This fact is both a crucial strength and a crucial weakness. Kerlinger 
(1973, p. 538) captured the essence of this argument when he suggested that: 


The observer must digest the information derived from his observations and then make 
inferences about constructs . ... The strength is that the observer can relate the 
observed behavior to the constructs or variables of a study: he brings behavior and 
construct together ... . The basic weakness of the observer is that he can make quite 
incorrect inferences from observations. 


However, Becker and Geer (1957, p. 32) suggested that the observer who is aware 
of these problems of inference is well equipped to deal with them because “he 
operates, when gathering data, in a social context rich in cues and information of 
all kinds.” They argued that this “evergrowing fund” of information gives the 
observer “an extensive base for the interpretation and analytic use of any 
particular datum.” 

The ethnographer, however, despite Becker and Geer’s encouraging words, 
must be wary of presenting observed facts as self-validating; to do so is to take a 
great deal of licence with many established research principles. Data gathered 
through direct observation are still data despite the fact that the observer usually 
both gathers and interprets the data himself. Trow (1957, p. 35) pointed out that 
“the data he collects are not a substitute for the interpretative inference. We all 
forget that at our peril.” 


Schatzman and Strauss (1973, p. 53) suggested another advantage of the 
observation method. They suggested that in using an observational approach the 
researcher does not need to rely on “first impressions for his descriptive and 
analytic stimulation” but endeavours to sustain a continuing “de novo sensitivity 
and appreciation” of all occurrences. In this way the researcher can avoid jumping 
to hasty conclusions and he can test his developing hypotheses in the light of later 
events: he is free to move from data to hypotheses and back again. Lutz and 
Ianaccone (1969, pp. 115-116) stressed an additional advantage of this research 
method. They pointed out that through the use of an inductive approach the 
“classification of (observed) behavior is not forced into a preconceived scale or 
index of behavior.” In a sense, the data are allowed to “speak for themselves” and 
“categories of behavior can be modified and remodified in order to better account 
for the data and thus more clearly describe the process.” 

Other advantages of the ethnographic method, mentioned by many of the 
authors already referred to, are that: 


1. it allows behaviour to be recorded as it occurs, in its “natural environment”; 


2. it permits the recording of “the stream of behaviour” so that whole events 
are preserved; 

3. it allows the routine of the job to be observed and recorded. This routine, 
which may be a key to the understanding of the essential characteristics of a 
situation, is often missed out by other data gathering techniques. 
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There are, however, a number of problems associated with the use of the 
ethnographic method. A number of the most crucial difficulties and decisions 
facing the ethnographer or field researcher are discussed in the following sections. 


The Effects of the Presence of the Researcher 


The ethnographer is usually forced to “tread a thin line” between over 
involvement and complete detachment in his research effort. His delicate position 
is well illustrated by the anthropologist Alan Beal (cited in Spindler, 1970, p. 4): 


Except under extraordinary circumstances, there will always be a kind of clear plastic 
film separating the fieldworker from the rest of the community .... The fieldworker 
who behaves like a mechanical man, who treats others as subjects, or who coldly 
calculates the degree of friendship to be extended in each case, is going to appear less 
than human . . . . Perhaps the ideal fieldworker would be a person who learned to 
behave in a natural and predictable manner, involved but not strongly involved. 


The ethnographer’s role in the research situation is, therefore, a problematic one. 
Because he is typically a fixture in any observational setting, his actual presence 
may have an impact on the behaviour being observed. If the ethnographer takes 
care to be unobtrusive and avoids giving the impression that judgments are being 
made, then his effect as a stimulus can be minimized. However, no matter what 
precautions the ethnographer may take, he can never be sure that his presence is 
not affecting the situation he is observing. Ethnographers, and indeed all field 
observers, would do well to remember this fact. 

In this present study the researcher took a number of steps to try to minimize 
the effects of his presence. Because this method of data collection is so dependent 
on the permission, goodwill, and cooperation of each respondent, specific 
arrangements and conditions were established with each superintendent: 


1. each superintendent involved in the study was fully informed as to the 
purpose of the study; 

2. each superintendent was given the opportunity to check on and clarify 
interpretations of events made by the researcher; 

3. each superintendent was guaranteed anonymity in the research report; 

4. an agreement was reached with each superintendent at what stage the 
observer would leave a situation when it was delicate or personal in nature; 

5. the researcher tried to make as few demands as possible on each cooperating 
superintendent. Apart from occasional exchanges for the purpose of 
clarification and/or interpretation, each superintendent was able to pursue 
his work unencumbered by the researcher’s demands. 


Overall, the research was conducted in as nonthreatening a manner as possible. 
Every attempt was made by the observer to establish a trust relationship with 
each superintendent observed. 

Another problem related to ethnographic research, which the researcher may 
or may not be aware of in his own research, is generally referred to as researcher 
bias. 
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Researcher Bias 

The researcher may consciously or unconsciously mould his interpretations of 
behaviours or events to fit his own preconceived ideas or prejudices. The notion of 
bias is, however, often confused with the idea that ethnographic research should 
not be contaminated by the resarcher’s own experiences and _ theoretical 
perspectives. 

The first thing that should be noted regarding the distinction between bias 
and subjective knowledge and experiences is that no researcher can approach 
research as a tabula rasa. Kaplan (1964, pp. 132-133) quoting the words of John 
Stuart Mill makes this point very forcefully. He states: 


After the moment of the observer’s birth no observation can be undertaken in all 
innocence. We always know something already, and this knowledge is intimately 
involved in what we come to know next, whether by observation or in any other way. 


Our observations are, therefore, always influenced by our prior conceptions: we 
view the world through our very own perceptual lenses. These lenses have been 
partly ground by our exposure to and knowledge of theoretical constructs and 
formulations — if such exist — related to the research problem. There is no such 
thing, therefore, as atheoretical research. An observation is itself purposeful 
behaviour in that it is the product of active choice on the part of the observer. This 
choice from among a plethora of alternatives is guided by how we conceptualize 
the research problem. Geertz (1975) suggested that theory helps the researcher 
construct a framework for his analysis and interpretation of events; it makes mere 
observation of events intelligible. 

The issue is further complicated by the fact the the researcher imposes his 
own world view on his observations by the very language he chooses to describe 
them. Karl Popper (1959, p. 59) reminded us that there is no purely “phenomenal 
language,” no language to describe something sensed and not interpreted. Fearing 
(1954), commenting on the ideas (related to this topic) of the anthropologist 
Benjamin Whorf (cited in Von Bertalanffy, 1973, p. 235) supported this argument 
when he stated: 


the commonly held belief that the cognitive processes of all human beings possess a 
logical structure which operates prior to and independently of communication through 
language is erroneous. It is Whorf’s view that the linguistic patterns themselves 
determines what the individual perceives in this world and how he thinks about it. 


Nietzsche’s point that there is no “dogma of immaculate perception” is, 
therefore, well taken. The researcher is, in a sense, the slave of his own cognition. 
His interpretations will be coloured by his prior knowledge and experiences — his 
view of the world. The ethnographer, because he is the instrument for both data 
collection and interpretation, must, therefore, make explicit for his readers his 
own particular orientation or conceptualisation of the research. He must also 
endeavour to present, to the best of his ability, a fair and balanced view of the 
meanings of observed actions and events. He must acknowledge —if he is aware of 
them — any particular biases he possesses toward any of the people or 
occurrences in his research. More than that he cannot do! 
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The problem of bias is closely associated with two other problems which are 
particularly difficult to deal with in ethnographic research — problems of 
reliability and validity. These will be discussed in some detail in the following 
section. 


Reliability and Validity of Results 


The first point that should be made is that the accepted notions of reliability 
and validity as they apply to quantitative data can be difficult to apply to 
findings in ethnographic research. With regard to reliability, standardizing 
research procedures to meet the reliability criterion of replicability may well be in 
conflict with the need for flexibility and open-endedness in this type of research. 
However, while the use of a set of precise definitions may be restrictive in that it 
places brackets around data, it seems necessary to use the accepted theoretical 
descriptive language to facilitate communication and understanding of the results. 
Silverman (1972, p. 223) encourages field researchers to use the accepted 
descriptive language of the social sciences because everyday language has a wide 
variety of meanings. He suggested that if everyday meanings were used in 
academic discourse, they “might prevent discussion or replication of a particular 
work simply because colleagues might be troubled over the particular meaning 
implied by the vocabulary employed.” 


A further problem for the ethnographer is that another observer watching the 
same behaviours might very well produce a different kind of analysis. Inter-rater 
reliability tests are generally not possible for the lone observer working in the 
field. It is encumbent upon him, therefore, to explicate for the reader his methods 
and procedures of recording, coding, and analysis, so that the reader can judge for 
himself the appropriateness of procedures or categories employed. 

Mindful of the problems involved, the researcher in this study adopted a 
number of measures in order to increase its reliability. First, the researcher used 
systematic recording and coding procedures, adapted from Mintzberg (1973). 
Secondly, pilot observations and interviews were conducted with two 
superintendents to test the appropriateness of the recording and coding 
techniques and the interview questions. Thirdly, the researcher defined the unit of 
behaviour for the study and the specific dimensions of this unit were established 
for recording purposes. Fourthly, through interviews and impromptu meetings, the 
researcher checked his interpretation of events with each superintendent. Fifthly, 
much of the data was generated through low inference level observations and 
these findings were used to test the propositions derived from the theoretical 
notes and the categories of purpose. Finally, in the role of observer as 
nonparticipant, the researcher tried to maintain objectivity and minimize bias in 
the interpretation of the data. 

The ethnographer also has to deal with the question of the validity of his 
conclusions. Lutz and Iannaccone (1969, p. 124) suggested that once reliability is 
established, validity is not so much a question in field observations. They stated: 


As Kerlinger has said when writing about the field study method, “There is no 
complaint of artificiality here. “The observer is viewing the actual behavior. He is not 
one step away from the behavior as is the case when tests are used to measure 
perceptions of behavior. Rather, the field observer is looking at the actual behavior. In 
this method, if the data are reliable, they are usually valid. 
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Despite Kerlinger’s reassurances, however, the issue of validity in this form of 
research must be addressed by the researcher. Erickson (1978, pp. 1-2) made the 
point clear when he suggested that, contrary to the general belief, narrative 
description may be low in validity. He stated: 


description is no philosopher’s stone, with which in some magical alchemy, two 
elements — sensitive but entirely “unsystematic intuitions” ... and the ability to write 
good prose — are combined to produce insights into the nature of the events described 
— insights which are invariably “true,” or “relevant,” or “useful.” 


Validity is, of course, concerned with the credibility of conclusions. When we 
ask “Are the conclusions valid?” we are essentially concerned with the “reality” 
from which the findings were extracted — Have we portrayed the reality in our 
narrative description? Validity from this perspective is essentially concerned with 
truth conditions, with the judgment that the results seem to fit the reality. The 
degree of match between the reality and the representation of it presented by the 
researcher is generally referred to as face validity. 


The credibility aspect of this form of research is concerned mainly with the 
need for the researcher to present a solid package of evidence to support his 
conclusions. It is, however, not always easy to summarize qualitative data and 
present it in such a neat package. Because such qualitative evidence often is 
comprised of written field notes, video recordings, anecdotal remarks, and other 
such bulky data, it may be too cumbersome and disruptive of the flow of 
argument to present evidence supporting each conclusion. The ethnographer, 
therefore, tends to resort to presenting a series of characteristic illustrations from 
various sources to support his conclusions. Glaser and Strauss (1968, p. 229) 
provide us with a good description of how the ethnographer conveys credibility to 
his readers: 


So the researcher also ordinarily utilises several of a considerable armamentarium of 
standard devices. He can quote directly from interviews or conversations that he has 
overheard. He can include dramatic segments of his on-the-spot field notes. He can 
quote telling phrases dropped by informants. He can summarize events or persons by 
constructing readable case studies. He can try his hand at describing events and acts; 
and often he will give at least background descriptions of places and spaces. 


Because validity is so closely related to interpretation — the more the burden 
of interpretation assumed by the researcher the greater the likelihood of 
developing invalid conclusions — the researcher must make explicit both the 
processes and the evidence used in arriving at his conclusions. There is one 
cardinal rule that the ethnographer must adhere to: Whenever he feels unsure 
about an interpretation, or foresees that his readers may be uncertain, he should 
state explicitly the evidence upon which he is basing his interpretation. 
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One final point should be made concerning the credibility of interpretations 
and conclusions. Geertz (1975, pp. 17-18) warns that “coherence” of argument 
cannot be the major test of validity in ethnographic research. He points out that 
“there is nothing so coherent as a paranoid’s delusion or a swindler’s story.” Our 
interpretations cannot, therefore, depend only on the tightness with which they 
hold together, on the style of presentation or the logic or plausibility of the 
argument supporting them. Neither can they depend entirely on the richness of 
the description. Such description could constitute on the part of the ethnographer 
an attempt to “dress up” the data and provide them with a substance and 
robustness that they do not possess. The message then is that the reader of 
ethnographic reports must be on guard. Part of the responsibility for testing the 
validity of conclusions rests heavily on his shoulders. Just because an account of 
behaviours or events seems plausible is no guarantee that it represents the reality 
from which it has, supposedly, been taken. 


The researcher in this study was always conscious of his responsibility to 
generate valid conclusions. The reliability measures described earlier were adopted 
in part to help the researcher arrive at valid interpretations of observed 
behaviours and events. In addition, the development of theoretical notes, as 
described earlier, provided the researcher with the opportunity to check and 
recheck his conclusions and interpretations. The researcher also tried to minimize 
the effects of his presence as much as possible. Conclusions and propositions were 
reached through a process of what McCutcheon (1978, p. 12) referred to as 
“structured corroboration.” She suggested that, using this process, “many facts are 
shown to support one another in agreement about a particular conclusion. A fabric 
of interrelated facts, anecdotes, quotes and other bits of information is woven.” 
This use of many items of evidence, together with a wide range of evidence, 
allowed for interpretations to be made with a greater degree of confidence. 


.One final problem related to ethnographic studies is worthy of note. This is 
the accusation by many who are critical of its approach, that it is impossible to 
generalize from its results. 


Generalizing from the Results 

The rules governing generalizability in the logico-deductive tradition are not 
really applicable to this form of research. First, because the purpose of the 
research is usually to generate an in-depth analysis of a situation or event, the 
sample chosen — and it is not really regarded as a sample — is often small, 
perhaps one case only. In addition, the use of random samples is generally not 
appropriate. The point being made is that different rules apply to this form of 
research. To talk about representativeness of sample, or size of sample is to 
subject it to a set of research values that do not apply. In other words, to speak of 
a case study as constituting an N of 1 is meaningless. Geertz (1975, pp. 25-26) in 
discussing the ethnographer’s responsibility for generalizing, suggests that what 
generality he contrives to achieve “grows out of the delicacy of . . . distinctions, 
not the sweep of. . . abstractions.” The essential task of the ethnographer, 
therefore, is not to answer the reader’s questions but to make available to him 
answers derived from an interpretation of the behaviour of others in similar 
situations. 


295 


P. A. Duignan 


Conclusions 


I now wish to pick up the theme included in the title of this paper and 
addressed briefly in an earlier section — that ethnography is an adventure in 
interpretive research. It should now be apparent to the reader that ethnography 
is, as Geertz has stated, “a venture in interpretive research.” Why view it then as 
adventure? The author proposes that it is indeed a type of adventure — a safari 
into the jungles of social meanings. There are a number of reasons which come 
readily to mind that qualify such a method as an adventure. 


First, the ethnographer frequently steps into the unknown. Thick description 
is actually an attempt to map uncharted areas full of rich meanings which are 
waiting to be discovered. 

Secondly, it is an exciting form of research, as any adventure should be. The 
researcher is constantly alert, full of anticipation, wondering what will turn up 
next. 

Thirdly, it is an exploration. The nooks and crannies of each phenomenon, 
event, and experience have to be explored. New meanings are discovered and kept 
for a more detailed examination in the light of unfolding events. 


Fourthly, it is an adventure because it presents a challenge to the researcher, 
who has to make sense of the multiple realities he observes. This is no easy task. 


Fifthly, there are numerous pitfalls into which the unsuspecting researcher can 
plunge. The ethnographer can easily lose his direction in the amorphous mass of 
observations confronting him. He can also take the wrong direction in his 
attempts to derive meaningful inferences from his data, and thereby generate 
invalid conclusions. 


Finally, the ethnographer has to perform a very demanding and rigorous task. 
Those who criticize this form of research because it generates “only soft data” and 
who intimate that such research requires less expertise than “real” empirical 
research have an unfounded argument. The ethnographer, in fact, has to wear a 
number of hats. He has to be part historian, part psychologist, part sociologist, 
and part anthropologist. He is of course no less a scientist than his colleagues 
conducting empirical research, because he must apply rigorous and consistent 
rules and procedures to the various steps in his research effort in order to arrive 
at reliable and valid conclusions. 


Overall, then, the ethnographer faces an exciting and challenging undertaking. 
He encounters many problems, but with careful preparation and an intimate 
knowledge of the strengths and weaknesses of this form of research, he has a 
useful contribution to make in illuminating theory and practice in his field of 
endeavour. 
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BOOK REVIEWS 


RUTLAND STREET: THE STORY OF AN EDUCATIONAL EXPERIMENT 
FOR DISADVANTAGED CHILDREN IN DUBLIN. By Seamas Holland. The 
Hague: Bernard van Leer Foundation; Oxford: Pergamon Press; 1979, xv, 112 pp. 
ISBN 0-08-024264-2 


The Irish Department of Education and the Bernard van Leer Foundation at 
the Hague cooperated in setting up an educational experiment for disadvantaged 
children in Dublin in 1969. This book includes descriptions of the location 
selected for the experiment, planning of the project, background implementation 
of the educational programme, and community and home programmes developed. 
The design of the project and its findings are evaluated here together with short 
chapters on the reactions of parents and outcomes of the project. 


Disadvantaged children are defined in this book as those who, because of 
sociocultural factors, have limited knowledge and skills which, combined with 
particular attitudes, may result in adjustment problems on entering school and in 
learning difficulties in school. The characteristics of the area served by Rutland 
Street School, the school built for this experiment, include “high unemployment, 
large families, inadequate living accommodations, early marriage, marital 
disharmony amounting at times to ill-treatment and desertion, inability to match 
resources to needs, chronic ill health, inadequate knowledge of developmental 
needs of children, and a lack of understanding of the purpose of school” (p. 12). 


The developers of the educational programmes for this pre-school centre were 
cognizant of the assumptions, strengths and weaknesses of the American head 
start project, of Basil Bernstein’s work on the theory of restricted and elaborated 
linguistic codes, of the preschool playground movement in New Zealand, and of 
the preschool education approach in the British nursery school. Among other 
things relating to cognitive development and social maturation, it was decided 
that “language should be at the heart of the curriculum” (p. 29) and that the 
instructional objectives were to be in terms of pupil behaviour. 


The community programme included strategies such as Mothers’ Club, Advice 
Centre, informal visits by parents to the school, and formal meetings of teachers 
and parents. Concern for the importance of home environment in the social and 
psychological developments of children, particularly during the early years, 
resulted in the organization and implementation of a home programme. One 
dimension of this programme was to have mothers attend school “for one hour per 
week to work through a prepared lesson with their children under the supervision 
of one of the social workers” (p. 58). Another part of this programme, one more 
integrated with the educational experiment, was to attempt to change the home 
conditions of the disadvantaged children mainly by attempting to change the 
teaching style of the mothers. This involved teachers visiting the home and 
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teaching lessons in this setting. The Rutland Street School received international 
attention because it was the first European project to use home teaching as an 
intervention procedure to help disadvantaged children. The success of the initial 
programmes in this regard was such that it gave encouragement to embark on 
other similar programmes. However, the conclusion was that in order to achieve 
“significant results, programmes (of this kind) may have to be accompanied by 
effective action to ameliorate the harsh and unstimulating environmental 
conditions which seem to frustrate the best efforts of disadvantaged parents to 
establish satisfactory relationships with their children” (p. 64). 


The design of this project included a control group of “non-disadvantaged” 
children. The findings as outlined by Holland include the fact that the average IQ 
score had an initial gain and then a progressive decline. However, there was a 
tendency for less bright children to maintain or improve their position. There 
were also gains in performances on a test designed to measure achievement in 
areas regarded as necessary for success in school, but the experimental group and 
the control group had the same English reading quotients. 


Holland combines impressions gained from day-to-day interactions with 
parents and the results of an “objective survey” to indicate the positive reactions 
of parents to the project. It was found that mothers’ attitudes toward the children 
changed and they became more active in the educational process of their children. 
The outcomes of the project as presented in this book include a couple of other 
studies which were done in other cities in Ireland, the development of teaching 
strategies to engage children’s interest, and the development of an appropriate 
curriculum. Since the experiment was over, in 1974, summer courses involving 
parents have been offered for teachers in disadvantaged areas. 


Aside from the descriptions contained in this book, the author touches on (ch. 
12) some of the basic sociological assumptions in deciding to develop plans for 
intervening into the home lives of young children in order to improve their 
chances of becoming successful in school. Some may argue that there is need to 
change the societal system so that there will be a widening of the range of options 
open to the disadvantaged, while others see the greater need as one of educating 
people out of their poverty. On another vein of thought, the issue of measuring 
success is pointed to. “While the measured effects of the project failed to show 
marked improvement in the scholastic performance of the children, there are, 
nevertheless, some indications that the overall approach has much to recommend 
it” (p. 100). These indications are improvements in student attitudes and in the 
social atmosphere of the school, and increases in student willingness to learn and 
in parental awareness of the purpose of schooling. 

This is a useful little book. For those involved in planning intervention 
projects, it is a useful description of the complexities of the organization, 
implementation and assessment of such projects. On a more theoretical level, it 
could help one grapple with the assumptions often made concerning the 
sociocultural environments of the disadvantaged, their abilities to learn, and the 
well entrenched tenets of teaching. The biggest weakness of this book is one of 
omission. Specifically, except for a few brief references to parents (pp. 83-87), 
children (pp. 100-101), and social workers (pp. 49-53), we learn little of the 
experiences of the teachers, pupils, and parents who participated in this 
experiment. This omission is serious, because the author suggests that the success 
of the project lies in these experiences and resultant behaviours, and is not 
indicated in traditional IQ and academic performance benchmarks. 


Wilfred B. W. Martin 
Institute for Research in Human Abilities 
Memorial University 
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INTELLIGENCE: AN INTRODUCTION. By ae W. Pyle. London: Routledge 
and Kegan Paul, 1979, 123 pp., $10.50. 


This small book, of some 100 pages, is by a lecturer in an English college of 
education and is for “all those who are educationally concerned with developing 
abilities in children . . . who find more advanced texts difficult to read”. Thus it 
arouses interest for possible use in initial teacher-training in Canada. In 1979, two 
important books on Intelligence were published, one by P. E. Vernon and the 
other by H. J. Eysenck. They are, in their different ways, written with rare clarity 
and scholarship; they too are aimed at non-specialists. Even so, I find I can only 
prescribe them for my graduate classes; undergraduates in education can get a 
great deal out of them in time, if they are given a lot of help, but unfortunately, 
with their limited background and the pressures of their curriculum there is not 
the time. Hence the interest in Pyle’s new book. 

The titles of the five chapters are predictable. The first asks, “What is 
Intelligence?”, the second, “Can it be measured?” The next two deal with 
nature-nurture and development while the last is headed “Intelligence and 
Learning.” The layout is traditional, no tricolour printing, no pictures, cartoons, or 
other “aids to learning.” It requires of the reader about the level of general literacy 
and reasoning power of a somewhat above-average arts graduate. It was found 
very demanding for first year education students; in fact, much of Vernon’s book, 
which has the advantage of the meticulous and complete coverage of the literature 
so characteristic of his work, was no more difficult for them. 


Although the chapter titles of Pyle’s book appear familiar, the approach is not. 
Strongly influenced by information theory, there is much less concern than usual 
over individual differences of genetic origin and some undue optimism for the 
teacher’s potential to improve children’s intellectual powers. The definition of 
intelligence, reminiscent of people like Lloyd Humphreys, includes the whole 
repertory of learned skills, knowledge, processes, learning sets and so on which are 
acquired throughout life. The emphasis is on the development of learned 
strategies of thinking with the promise that, in spite of Coleman, performance 
contracting, Collins and Headstart, we may “start to make educational systems 
more effective, and improve the efficiency of children’s brains and their learning” 
(p. 69). The stance is thus at the other end of a continuum starting with Ertl’s 
speed of response of electric potentials and Jensen’s choice-reaction times. 
Intelligence tests, IQ, and age norms are denigrated; criterion referenced testing 
and mastery learning are approved. 


In spite of this, there is a brief survey of the contributions of Galton, 
Spearman, Burt, Thurstone, Guilford,and so on,but the sheer concentration of the 
material makes comprehension difficult for those who are meeting concepts for the 
first time. On the relative contribution of genetics and environment, a moderate, 
wholly acceptable position is stated, e.g., “both genetic factors and environmental 
factors are crucial,” but then the genetic contribution tends to be ignored. The 
Lesser, Fifer and Clark (1965) results, for example, are quoted insofar as ethnic 
group, social class and patterns of ability are concerned, but the marked ethnic 
differences in overall ability are omitted. Then the conclusion, “This amply 
demonstrates the effect which culture can exercise on the mental abilities of 
children” does not follow from “within each group the middle class children 
tended to have similar patterns only higher scores all round” (p. 57). Genetic 
influences were also there. Similarly, it is rightly claimed that low maternal IQ is a 
strong predictor of low functioning in a child, but it does not follow that the cause 
is only lack of stimulation and verbal encouragement at home. And the finding 
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that first-born children have higher IQs is presented as an environmental effect, 
ignoring Schooler’s suggestion that it may well be a statistical artefact. 


One-sided selection of evidence also mars other parts of the book, e.g., “many 
would accept the validity of the principle behind the findings” of the Rosenthal 
and Jacobson (1968) study. This, in spite of the admittance that the work was 
flawed. Too much weight is sometimes given to lightweight authors, e.g., Holt, and 
serious criticism of the positions held in the book is passed over. This is a pity 
because there is a lot here which is well written and provocative of serious 
thought. I myself would not wish to prescribe the book for use by students in 
education except, perhaps, for critical evaluation by PhD candidates with a good 
background in psychology. 


L. B. Birch 

Macdonald Professor 

in Educational Psychology 
McGill University 


LABOURING CHILDREN: BRITISH IMMIGRANT APPRENTICES TO 
CANADA, 1869-1924. By Joy Parr. London: Croom Helm; Montreal: 
McGill-Queen’s University Press; 1980, 181 pp., $25.95 


Since Neil Sutherland’s Children in English-Canadian Society (1976), 
students of the history of Canadian childhood have eagerly awaited this first 
major study of British child emigration to Canada. Partially based on Parr’s 
unpublished 1977 Yale University dissertation, this monograph demonstrates the 
application of new methods and analyses to Canadian social history. The heart of 
Labouring Children is a sample of 997 children and youths emigrated to Canada 
by Dr. Barnardo’s Homes from 1882 to 1902. The sample was developed by 
selecting every twentieth juvenile on the Barnardo’s sailing party lists, which 
included both Barnardo wards and children seconded by Poor Law Guardians for 
emigration. Dossiers containing relevant data were compiled for sample members 
before Parr developed a codebook with 52 variables, which allowed for 
computer-assisted analysis of the data. 

In addition to this quantitative data, traditional research methods were used to 
examine historical records and manuscript collections of emigration homes and 
public archives and in the analysis of relevant published primary and secondary 
sources. 

In four condensed chapters, Parr describes the conditions of British children 
most likely to be admitted to British rescue homes, the ideologies and practices of 
evangelical child savers, the children’s initial reception in Canada from 1869 to 
1900, and the social and moral consequences of emigration societies’ policy that 
separated children from parents, relatives, and friends. 

The next three chapters reconstruct the life cycle of child immigrants in 
Canada from apprenticeship or adoption to household membership and finally to 
adulthood and independence. Data gathered by traditional research methods are 
woven into the results of quantitative analysis to provide a collective biography of 
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the approximately 80 000 “home children” who settled in Canada between 1869 
and 1924. While much of the reconstruction of the rhythm and routine of rural 
Canada is unremarkable, some conclusions stand out. First, Parr’s judgment that 
the indenture system — especially if there were support payments for children 6 
to 10 years old — “did more to define the rights of British immigrant children 
than to extinguish their rights” (p. 84) demonstrates, at least in the case of 
Barnardo’s and the Catholic agencies, the pragmatic genius of the child-rescue 
societies. Secondly, adjustment problems were often related to the discrepancy 
between the relatively pleasant life of the British children’s institutions and the 
isolation, social distance, and harshness of Canadian rural life (pp. 101-103). 
Thirdly, Parr establishes that the movement of children from one situation to 
another — particularly in the case of boys — was often related to the necessity of 
matching the children’s economic value to land productivity. Boys, and girls less 
so, moved from least to most productivity agricultural lands as they progressed 
from being boarded out to earning their subsistence and schooling by helping with 
farm work to earning wages as farm hands (pp. 86-89). 

Despite Parr’s impressive research and the significance of juvenile emigration 
in understanding the development of Canadian social policies and practices in the 
twentieth century, the book is disappointing in its scope and insight. The dates in 
the subtitle are deceptive in that the developments after 1900 are not adequately 
analyzed, with only the short eighth chapter devoted to the twentieth century. 


Given Parr’s emphasis on evangelicals and their societies, one is disappointed 
by simplistic descriptions of their motivation and theology (pp. 35-37). Moreover, 
by curiously restricting “evangelical” to British Quakers and Plymouth Brethren, 
Parr distorts the concept so as to exclude members of other Christian 
communities (p. 105). Finally the literature on the British evangelical movement 
suggests that it is a major error to assume that the intensity of religious 
commitment and activity remains constant over one’s life, which Parr is inclined 
to do in reference to child savers such as Thomas Barnardo (p. 142). 


In a treatise filled with statistics, there is no clear breakdown of the age at 
emigration for the 997 children in Parr’s sample. This is a significant point 
because both contemporary observers and Parr criticized the practice of separating 
young children from family, friends, and homeland, and emigrating youths before 
they had reached the age of legal consent (p. 11). Were the vast majority of 
juvenile emigrants between 1869 and 1908 children, i.e., under twelve? Parr claims 
that the issue of child emigration was settled in March 1925 with Dominion ban 
on the movement of children under fourteen years of age which became 
permanent in 1928 and that “the boys and girls who came to Canada through the 
1920s and 1930s were teenagers searching for their first paid jobs who had 
completed their schooling and made their own decisions to leave home and family 
and begin a new life abroad” (p. 153). Such a statement is anomalous since in the 


four years, 1920-23, excluding emigration under the Empire Settlement Act, 5 100 
children were sent to Canada. Are we to understand these “children” to be the 
teenagers of the 1920s and 30s? It would also seem that 5 000 out of the 80 000 
estimate of emigration is not a trivial number. 


The problem of age at emigration also relates to three other issues. First, little 
children of ten years and under appeared as often claimed by emigration societies 
to have made a better adjustment to Canadian life than did adolescents (pp. 109 
and 115). Secondly, labour groups such as the Trades and Labour Congress 
opposed the movement not only because of potential urban drift as apprentices 
completed their indentures, but because many “children” were old enough to 
compete immediately with Canadian labour (p. 54). Thirdly, because Parr hails 
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the end of child emigration in 1925 and comments favourably on the fortunes of 
Barnardo immigrants in the 1920s who “were not typical child immigrants” (p. 
124), we need to know in detail the age composition of all children from 1868 to 
1924 and not a truncated Barnardo sample of 1882 to 1908. 


Parr’s favourable assessment of the 1920s emigration (the 1930s hardly count 
since the depression destroyed most of the demand from any kind of farm labour 
as well as opportunities for land ownership) is curious in that the most carefully 
orchestrated campaign of vilification of the emigration societies and their wards 
occurred in the 1920s and set the stage for the much admired ban on the under 
fourteen-year-olds. The remarkable aspect of the antijuvenile immigration 
campaign led by Charlotte Whitton and Canadian Council on Child Welfare was 
the crude mixture of solicitude for the circumstances of the children and 
unrelieved efforts to establish their degenerate hereditary traits and moral failings. 
Indeed, Parr’s unwillingness to confront adequately not the harsh treatment 
meted out to child immigrants by Canadian masters and mistresses or the severe 
emotional burdens of isolation and impermanence of placements but their 
persistent characterization as the offspring of the degenerate classes of British 
society is one of the study’s major flaws. Duly accepting Canadian complaints 
about the prevalence of enuresis, masturbation, and illegitimacy, Parr provides us 
with a detailed analysis of the medical history of the offenders (pp. 103-105 and 
115-117) but no analysis of the class and social thought that gave rise to the 
obsession. 


A second major flaw is that the study is whig history dressed up like 
revisionism. We are treated to some tantalizing hints that the “history [of the 
child emigrants] does reveal a darker side of evangelicalism, of turn-of-the-century 
childhood, and of Canadian rural life than the record has so far been allowed to 
express” and that the apprentices “certainly were over-worked, harshly treated and 
little more influenced by common school and child protection movements in 1924 
than in 1870” (p. 12). By the time Parr reaches the end of her discussion of the 
apprentices’ experience, she concludes that “the child immigrants were materially 
better off in Canada than they would have been in Britain” and that “the gains 
are clear, whether or not they compensated for the hardships entailed in early 
emigration. They were, however, far smaller than the reformers had predicted” 
(pp. 138-139). Rather than a dark tale, Parr provides us with an abstract story of 
the transformation of the concepts and practices in child welfare that put an end 
to the child emigration movement. 


On the Canadian side, the early critics of the movement, i.e., county sheriffs 
and gaolers, country doctors, Dominion members of Parliament, the Dominion 
Trades and Labour Congress, and urban middle-class reformers, had not only 
failed to generate the debate needed to improve substantially the conditions of the 
children but had instead “heightened the tensions of the apprenticeship years and 
left home children with a heavier burden to carry” (pp. 57-58). It is disappointing 
to note Parr’s bland description of the statements by these critics as “a heavier 
burden” when they characterized the children as having “syphilitic tendencies,” as 
being “imbued with crime since infancy,” part of Britain’s “undesirable 
population,” and carriers of “evil habits” (pp. 54-57). Similar sporadic criticisms by 
the Local Government Board, Boards of Guardians, and others evoked palliatives 
on the part of Canadian authorities (pp. 34, 51-52, and 145). But like Whig 
versions of common schooling, the notion of a true childhood for all children was 
an idea whose time would finally come in the twentieth century! 
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Moreover Parr argues that the growing political power of British labour cast 
child emigration in a new ideological light which culminated in the Bondfield 
Report of 1924 (pp. 145-147 and 153), while on the Canadian side, the standards 
of child welfare for Canadian wards had far outstripped the best efforts of the 
emigration societies (p. 150). Thus, she continues, by the 1920s, the scientific and 
professional interests had coincided with the advanced political and social views of 
British socialists (p. 153). The coincidence of interests and views, by bringing an 
end to child emigration in 1925, ensured that all children would have a childhood 
(pat2y 

Rather than a tale of advanced social ideas whose time had come, the history 
of child emigration is a complex story of human misery, religious philanthropy, 
imperial ideologies, human greed, and heartless and professional ambition. Of 
these we hear almost nothing in Parr’s antiseptic account of the Canadian side of 
the Home Children. Hopefully their ordeal will be told elsewhere. 


R. L. Schnell 
Department of Educational Foundations 
The University of Calgary 


TEACHER DECISION-MAKING IN THE CLASSROOM. Edited by John 
Eggleston. London: Routledge & Kegan Paul, 1979, 234 pp. 


This collection of papers, revised versions of those prepared by members of a 
working group of the Social Science Research Council, bears all the hallmarks of a 
set of conference papers. Each presents a particular view on the common theme, 
the process of teacher decision-making, with the editorial introduction attempting 
little more than a lively account of the role of the classroom teacher, the findings 
of the most pertinent recent research, and the particular aspects of the problem 
the individual writers have tackled. Having made that point, it is necessary to add 
that few working groups could be counted on to produce a set of papers as 
carefully argued and as calculated to encourage and direct further research. 
Moreover, few would evidence such a general concern that classroom teachers 
themselves should be drawn into this kind of research and that their suggestions 
and objections should be taken seriously, not dismissed as further evidence of the 
conservatism of an allegedly conservative profession. 


The emphasis is on empirical research, based on an examination of the context 
in which teachers make their decisions. The most thorough-going attempt at 
clarifying “the nature and incidence of decisions and the complete network of 
variables that surrounds them” is that of Sutcliffe and Whitfield, who set the tone 
of the volume by their insistence on the need for a definition of decision-making 
that is “logically consistent, empirically testable and empirically justifiable.” Their 
stance is ably defended, but it seems a little odd that in their consideration of 
teaching, consideration of the (at least) complementary activity of learning is set 
aside, as becoming relevant only when the “quality” of teaching is being examined. 
Admittedly, the plethora of variables makes any research into teacher-pupil 
reaction fiendishly difficult and Sutcliffe and Whitfield are to be commended for 
the very precise delineation of the area of their investigation. Nevertheless, as 
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they themselves point out, any research in the field of teacher-pupil interaction is 
influenced, however subtly, by the researcher’s notions of effective and desirable 
teacher behaviour. That being so, it is a pity that the case for regarding learning 
as a central characteristic of that particular form of interaction called teaching 
was not examined in detail. The disappointment is all the greater because the 
analytical introduction to their account of their research project is both fully 
developed and an integral part of the argument, taken as whole. (In contrast, 
Bellaby’s important, provocative paper, “Towards a_ political economy of 
decision-making in the classroom,” seems somewhat contrived, the theoretical 
statement of the introduction being linked rather awkwardly with an account of 
painstaking research which could almost have been presented independently of 
the preceding section, were it not that the alternatives presented in the conclusion 
presuppose acceptance by the reader of the position taken in that preliminary 
exposition.) This paper by Sutcliffe and Whitfield is also noteworthy for the 
thoroughness with which the research methodology is explained and the attention 
of other workers is drawn to the many fruitful if complex areas that demand 
further investigation. Anyone interested in either teacher training or research into 
aspects of classroom teaching would find this paper valuable. 


Another particularly interesting paper is that of Finlayson and Quirk. 
Although their study of the social processes affecting both teachers and those who 
try to influence them necessarily draws on English examples, their argument is 
equally pertinent in other contexts. Neither the confusion of slogans with 
programmes and of both with the justification of particular aims, patterns of 
school organization, or methods of teaching, nor the sense of failure that saps the 
energy and self-confidence of trainee teachers (and, at times of dramatic change in 
curriculum or methods of teaching, that of many experienced teachers also) is 
restricted to English experience. And the case made by Finlayson and Quirk for 
attributing part at least of the individual teacher’s disquiet to unsuccessful 
attempts to cope with a growing number of vaguely stated and sometimes 
conflicting role expectations is most convincing. The problem tackled by Finlayson 
and Quirk is given a further dimension by David Hargreaves’ stress, in his paper, 
“A phenomenological approach to classroom decision-making,” on the extent to 
which teachers’ handling of day-to-day decisions reflects not only their teaching 
style and ideological commitment (in the professional area) but also the whole 
range of their social values. Just how values are related to classroom practice is, of 
course, far from clear; perhaps, as Hargreaves suggests, an exploration of 
decision-making in the classroom situation will help to clarify and explain the 
nature of that relationship. Certainly if we could see “how values are embedded in 
practice” we might be able to help both beginners and experienced teachers to 
understand better their own practice. Various other papers suggest ways in which 
that examination could gain a new perspective: Keith Drake uses the economic 
model of choice to stress the need to remember the benefits renounced by the 
decision-maker at the moment of choice and hence the desirability of estimating 
the cost of any extra commitments in terms of renounced benefits; i.e., the 
benefits which could be expected to accrue from an alternative use of those 
resources. This underlines a point made earlier by Finlayson and Quirk; namely, 
that teachers cannot devote time and attention to additional tasks or additional 
time and attention to some existing task or tasks without sacrificing some possible 
other valuable activity. Drake pursues this theme, stressing the number of goals 
pursued concurrently by the teacher and the ways in which resources have to be 
distributed between them. Without denying the importance of the unconscious 
influences on decision-making stressed by Sutcliffe and Whitfield and, from a 
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different viewpoint, by Bellaby, Drake stresses the importance of the chooser’s 
perception of the context of his choosing and the, consequent need for further 
study of the ways in which teachers decide what possibilities are open to them in 
a given context, evaluate those options and decide between them. Drake’s view of 
the teacher as chooser frees the teacher from the deterministic model of 
neoclassical economics and leads Drake to the view that research methodology 
modelled on that of the natural sciences is unlikely to give sufficient recognition 
to the particularity and subjectivity of many classroom decisions. 

The more specific papers which are placed towards the end of this volume lose 
nothing by the careful limiting of their scope. Though varied in the stand-points 
from which they are written, they all demonstrate that aspects of teachers’ 
decision-making can even now be studied, that teachers’ perceptions of the 
classroom situation can be modified, and that the opportunity therefore exists to 
reduce the gap between what teachers seek to do and what in practice takes place. 


In summary, the papers in this volume show the scope of the problem — how 
teachers cope with the multifarious decisions of the classroom — examine it in the 
light of different and sometimes conflicting viewpoints, indicate fertile areas for 
further research, and show that even the most radical interpretation of our society 
leaves some room for effective individual action by the teachers. There is some 
repetition, and a more sustained attempt to identify and evaluate competing 
accounts of the nature of the problem would have been welcome; nevertheless, the 
collection as a whole is a valuable account of a concerted, first attack on a rich 
and complex area for research. 


Wilma Hannah 
Department of Education 
University of Melbourne 
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Before joining the Faculty of Education in 1959, Dr. Nelson served as a 
superintendent, principal, and teacher in Alberta. He has held short term 
appointments at McGill University, California State University, and the 
University of Minnesota, where he received his PhD in 1962. He was acting 
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JANET P. SIMPSON 
Alberta West Central Health Unit, Edson 


and 


JAMES MONROE STEWART 


Tennessee State University 


Expressive English Syntax in Canadian 
French-Immersion School Children 


The purpose of this study was to examine some specific English expressive 
syntactic structures in a group of Canadian children enrolled in a French 
immersion educational program. The home, environmental, and native language 
of these children was English. Fifty-seven children enrolled in junior 
kindergarten through third grade participated. The Structured Photographic 
Language Test (SPLT) was used to evaluate syntax. No significant differences 
were found on overall mean and group scores for this group and the English 
monolinguals on whom the test was standardized. Item analyses for types of 
error revealed group differences. Implications for education and therapy are 
discussed. 


Since 1965, a number of French-immersion programs have been developed and 
implemented in Canada. In this type of educational setting, children from 
English-speaking homes and environments have attended schools that offer 
instruction entirely in the French language. These programs provide an 
opportunity for students to learn a second language (French) while they are 
acquiring and mastering their mother tongue (English). Unlike the situation in the 
United States, where the main objective of bilingual programs is retention of an 
immigrant’s native language, the goal of French immersion programs is to achieve 
native-like fluency in both languages. 


Janet P. Simpson (MS, Tennessee State University, Nashville, 1980) is a speech-language pathologist employed by the 
Alberta West Central Health Unit in Edson, Alberta. The study reported here is one of several research projects initiated by 
Ms. Simpson during her graduate matriculation. 

James Monroe Stewart received his PhD in Hearing and Speech Sciences from Ohio University (Athens). His primary 
interests are in speech perception with emphasis in distinctive features and their saliency and psychological reality and in the 
prevalence of communicative disorders. 
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Studies have been undertaken to examine the process of _ these 
French-immersion children in the areas of English reading, writing, spelling, 
arithmetic, cognitive skills, and receptive and expressive language skills. Generally, 
these students demonstrate “deficits” in their receptive and expressive English 
language abilities for the first several years, but perform at a level similar to that 
of their English-instructed peers by the third grade. More recent research 
indicates that these children outperform students in regular English programs by 
the end of elementary school (Maynard, 1980). 

In several longitudinal studies reported by Swain and Barik (1976), long-term 
deficits in the English language development of French-immersion subjects have 
not been found. Swain (1978) found that immersion subjects performed 
consistently poorer than their monolingual peers in English reading and spelling 
skills through the end of the first grade. Similarly, Marckworth (1978) found that 
bilingual children lagged behind in the acquisition of English morphological 
structures, reaching the monolingual norms at a later age/grade level. More recent 
and related developments have been outlined by Maynard (1980). 


French-immersion Programs 


Genesee (1979) reported on the scholastic effects of participation in the 
French-immersion program of the Protestant School Board of Greater Montreal 
(PSBGM). This program was among the first to be developed in Canada, and is 
currently the most extensive one in the country. In September 1977, some 9462 
children were enrolled in Kindergarten through Grade 11. Longitudinal studies 
were developed to make between-groups comparisons. English and French control 
students were matched on the basis of intellectual indices and socioeconomic 
status (SKS). . 


Typically, the children enrolled in PSBGM and similarly structured programs 
reported on by Swain (1978) and Sweet (1974) were of normal or above IQ and 
from English-monolingual, middle SES families. Genesee (1979) found that 
French-immersion children acquired parity with the English control subjects by 
Grade 3 in formal testing of all aspects of English language development, except 
spelling, where parity was obtained by Grade 5. They generally scored lower in 
reading, word discrimination, spelling, and vocabulary; fewer significant 
differences were found on tasks not requiring (necessarily) formal literacy, such as 
listening comprehension, aural decoding, and aural vocabulary and skills. At the 
same time, results of testing French language skills indicated that the immersion 
subjects had acquired “native-like” proficiency in these areas, including literacy 
skills. Other studies on French-immersion in Canada, by Swain (1978) and Sweet 
(1974), are supportive of these findings. 


All aspects of academic achievement, including mathematics, in terms of 
content material were found by Genesee (1979) to be similar and comparable for 
both groups. Similar results were obtained by Panunto and White (1979) for 
native-speaking Italian children who spoke English as a second language. Results 
of these and related studies indicated no long-term, negative effects in English 
language development or academic achievement among participants. 

The recurrent issue on French immersion is that differences are noted early in 
the formative years, but disappear as the students progress. What is not clear is 
whether the differences between the monolingual subjects and those enrolled in 
immersion programs area function of program exposure and/or progress or 
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maturation. More important is the nature and substance of the differences. Miller 
(1978) generates related dimensions in references to evaluation, assessment, and 
prognosis. All of these issues are germane to understanding bilingualism, but one 
issue is more directly related to the present study, which addresses a specific 
aspect of this problem. 


Perspectives on Bilingualism 


French-immersion programs are similar to the type of educational placement 
experienced by immigrants in the United States. For example, there are children 
whose native language is Spanish, and it is the only language spoken in the home. 
These children are being educated in a second language, that is, English. In this 
situation, the children are expected to learn the new language of the school and 
community; their native language receives no encouragement outside the home. 
The French-immersion children, on the other hand, are learning a new language 
that will complement the native language which is reinforced in the home, school, 
and community. There are, however, important differences between the two 
educational concepts. 

Cummins (1978) discussed several reasons why immersion programs have been 
successful when other bilingual programs have not. The French-immersion child is 
participating in an additive bilingual program. The acquisition of a second 
language is not making the native language less important. The immigrant child is 
participating in a subtractive bilingual program. The acquisition of a second 
language is imposed at the expense of the native language. Cummins states that a 
cognitively and academically beneficial form of bilingualism can be achieved only 
on the basis of uninhibited development of the native language. 

A discussion of the implications and consequences of additive and subtractive 
programs in bilingualism is beyond the scope of the present paper. The essential 
points here are (a) the recognition of how the second language is learned, that is, 
in reference to the native language, (b) its social encouragement, and (c) the 
environmental influences. These are important variables in the acquisition of a 
second language in bilingual programs. These variables are confounded for those 
children whose second language is still in the developmental stages or whose 
native language is deficient or faulty (see also Glass, 1979; Miller, 1978). 

Syntax. Hamayan, Markham, Pelletier and Tucker (1978) examined 
English-speaking bilinguals in an immersion program for their French language 
competence. To accomplish this, 60 fourth- and sixth-grade children were tested 
by sentence repetition or elicited imitation. The bilingual and French monolingual 
groups each contained 30 children with equal numbers in each grade. By observing 
errors in the second-language learner, the authors hoped to make inferences about 
their developmental stages. They felt that the syntactic structures not available in 
the children’s competence would be distorted in their production. 


The results indicated that the performance of the bilinguals was different and 
not as proficient as the French group in their usage of the sentence structures. 
The difficulty and inefficiency on the task were more profound in the bilingual 
than the French group. For both groups, sixth graders performed better than 
fourth graders, which reflects an age-maturity factor. Based on the methodological 
controls, the authors felt that the differences between the two groups were a 
function of differing internal grammars. Consistency of use characterized both 
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groups, but the French-monolingual groups utilized the various syntactic 
structures like the adult native French-speaker more than the bilingual group. 


Ramstad and Potter (1974) investigated the use of receptive vocabulary and 
receptive and expressive syntax between two groups of children, Nez Percé 
(North-American Indians) and white children, attending the same kindergarten 
class. The Nez Percé population was in the process of becoming monolingual 
(English) and showed a need for English language instruction. Overall, white 
subjects performed significantly better than the Nez Percé subjects in receptive 
vocabulary on the Peabody Picture Vocabulary Test (PPVT) by Dunn (1965), and 
in receptive and expressive language as measured by the Northwestern Syntax 
Screening Test (NSST) by Lee (1971). The results indicated that the Nez Percé 
subjects demonstrated linguistic deficiencies in the areas of English prepositions, 
plurals, possessives, verb tenses, and possessive pronouns. 

Braun and Klassen (1971) investigated the oral syntactic structures of bilingual 
and monolingual children. The monolinguals showed superiority over bilinguals in 
terms of syntactic flexibility and control. Jorstad (1971) and Spector (1972) found 
that the syntactic abilities of bilingual children were less well-developed than 
those of their monolingual peers. 


In considering the results of studies investigating the syntax of bilingual 
children, Glass (1979) found that little information is presently available 
concerning the development of syntactic patterns of bilingual children. Even less 
information is available on bilinguals’ expressive syntactic abilities (defined by 
Lee, 1971, 1974; Werner & Kresheck, 1974), that is, subjects’ production of syntax 
as opposed to their competence or comprehension. The present study is an 
outgrowth of Glass’ observation. 


The literature on French-immersion programs indicates that there are no 
long-term deficits academically. On the other hand, the literature indicates few 
studies which attempt to identify the nature of the shorter-term deficits and/or 
linguistic interference associated with the language development of bilinguals. The 
few studies which have been conducted indicate that bilingual children 
demonstrate deficits in the acquisition of syntactic skills when compared to their 
monolingual counterparts. However, any conclusions regarding these children must 
be guarded, since normative data and translated versions of the measuring 
instruments utilized in these studies have not been standardized for the 
populations under study (see also Glass, 1979; Miller, 1978). 

The purpose of the present study was to compare and to identify the 
differences in expressive syntax of children enrolled in a French-immersion 
program in Kirkland Lake, Ontario to English-monolingual children enrolled in a 
suburban-Chicago school system in the United States. For the latter group, the 
normative data was compiled and evaluated from the Structured Photographic 
Language Test (SPLT) by Werner and Kresheck (1974), who allowed the use of 
their data and measuring instrument. 


Method 


Subjects 


Age and grade. Subjects ranged in age from 4-0 to 8-11 and were enrolled in 
grades junior-kindergarten (JK), Kindergarten (K), 1, 2, and 3 of the 
French-immersion program. Subjects in JK, K, and grade 1 received all instruction 
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in the French language. Subjects in grades 2 and 3 received 80% French and 20% 
English instruction. Subjects in JK and K attended school for half-days only. All 
subjects entered the program at the JK level. A total of 57 subjects participated in 
the study. Table 1 shows the distribution of these subjects by age, grade, and sex. 


TABLE 1 


AGE, SEX, NUMBER, AND GRADE OF FRENCH-IMMERSION SUBJECTS 


Sex 
Age M F n 
Junior - Kindergarten (15) 
4-0 to 4-5 2 4 6 
4-6 to 4-11 6 2 8 
Kindergarten (11) 
5-0 to 5-5 1 2 3 
5-6 to 5-11 4 5 9 
First Grade (7) 
6-0 to 6-5 il 2 3 
6-6 to 6-11 p 1 3 
Second Grade (11) 
7-0 to 7-5 3 3 6 
7-6 to 7-11 2 2 4 
Third Grade (13) 
8-0 to 8-5 3 7 10 
8-6 to 8-11 4 itt 5 
28 29 a7 


Note. Number of subjects in parenthesis for grade differs slightly 
from number in groups for age. 


School environment. All subjects attended Central Public School, which is 
administered by the Kirkland Lake Board of Education and located in a middle 
SES area of the town. Kirkland Lake is located in northeastern Ontario, about 48 
km from the Province of Quebec, with a population of approximately 14,000. 

Language background. English was the native language of all subjects and was 
the primary language spoken in the home. A second language (for example, 
French, Estonian, Ukrainian, and Polish) was identified by seven parents via 
questionnaire as being spoken “some of the time” in the home. 

Speech, language and hearing screening. The school was in the process of 
referring students with suspected speech and/or language handicaps to a 
consulting speech-language pathologist. None of the subjects had been placed on 
the referral list. In addition, all students were screened for hearing loss by a 
public health nurse before entering the school system. No subjects were referred 
for further testing. 
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Intelligence. The subjects were of normal or above normal intelligence as 
evidenced by their classroom teachers’ identification of satisfactory/unsatisfactory 
scholastic performance in school. 

Other information. An information letter and permission form were sent to 
each subject’s parent, along with a short questionnaire concerning: 
presence/absence of languages other than English spoken at home, time spent 
speaking French at home, and time spent watching French television or listening 
to French radio. The purpose of the questionnaire was to determine the students’ 
general homogeneity. No subjects were excluded as a consequence of evaluating 
parental responses. These were some of the variables deemed important by Miller 
(1978). 


Measuring Instrument. 


Background. The SPLT was developed to assess expressive syntax in the 
grammatical forms of regular, secondary, auxiliary, and copular verbs; singular and 
plural nouns; subject, object, possessive, and reflective pronouns; adjectives; 
articles; conjunctions, contractions, negatives; and prepositions. The test was 
designed to obtain spontaneous (self-generated) responses from children 4-0 to 
8-11 years old. Standardization of the SPLIT was _ conducted’ on 
English-monolingual, white, middle-class suburban children. The SPLT is based 
on linguistic and psycholinguistic studies of children’s acquisition of the syntactic, 
morphophonemic, and semantic rules of adult language. 


The visual stimuli are full color photographs of children, adults, and animals in 
everyday situations and activities. The auditory stimuli consist of specific and 
brief statements or questions which the examiner gives as the accompanying 
photograph is presented. Each structure is elicited by an introductory statement 
and a photograph. 


Scope and limit of the SPLT. The test was standardized on 360 children 
between the ages of 4-0 and 8-11 years. These children came from the middle SES 
where Standard American English was the sole language spoken. The children 
came from preschools and schools in the northern Chicago, Illinois area and were 
of normal intelligence with no evident speech or hearing problems. 


The authors reported that preliminary analysis of data of 25 children 
diagnosed as being language delayed by the SPLT were also diagnosed as 
significantly language delayed by the Development Sentence Scoring (DSS) 
Method (Lee, 1974). 


Rationale for use. The SPLT was selected because it met the needs of the 
present study. The purpose of the investigation was to compare and to identify 
differences in English language usage by French-immersion children relative to a 
standardized population. The SPLT allowed specific erred grammatical forms to 
be identified, and established a functional language age and expressive language 
profile (Werner & Kresheck, 1974). Other reasons for this test’s selection were its 
relatively quick and easy administration and its ability to elicit specific syntactic 
structures. 


Testing environment. The subjects were tested individually in a room separate 
from the classroom, approximately 4.5 months into the school year. As a matter of 
convenience, students were tested by class, starting with JK. The subject sat 


directly across a small table from the examiner. The room was well lit, and the 
perceived noise level was low. 
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Evaluation. Kach subject was instructed individually after introduction into 
the testing environment. The following set of instructions was given to each child: 
“I have some pictures for you to look at. I would like you to tell me some stories 
about them.” The first photograph was then placed in front of the subject. The 
examiner read the eliciting statement on the response form. The child’s response 
was then recorded in the space provided. If the child did not respond, one prompt 
(as outlined in the test manual) was given. If still no response, the examiner 
continued with the next item until all 51 items on the test were completed. 


Reinforcement. Verbal reinforcers such as “You’re doing a good job”; “I like 
your stories”; and “Okay” were given at various phases of the test. 


Statistical Analysis 


The investigation was considered a comparative study between the 
French-immersion subjects and the normative data published by Werner and 
Kresheck (1974). The original design of the study called for larger numbers in 
each age group; practically, this was not possible because of the lack of parental 
permission and the low number of children enrolled in the program. Results of the 
study were interpreted by age group because of the maturation variable in 
syntactic development. 


The Fisher’s zr transformation was chosen as the appropriate statistic because 
the use of multiple ¢ tests and analysis of variance would result in loss of 
interpretive power due to small sample sizes. The correlation coefficients for the 
normative data and those derived from the present study were obtained by using 
the mean response scores and standard deviations for each age group. The 
limitations of this study warranted a liberal alpha level, which was arbitrarily 
selected at the 0.05 level. 


Results 


Table 2 displays the normative data from Werner and Kresheck (1974) and the 
data acquired from the present study. The table reflects a comparison of the two 
groups by age. In this investigation, the subjects 4-0 to 4-5 scored a mean correct 
response of 64. This matches the score for the same age group in the normative 
data. For the next several age groups, 4-6 through 5-11, the subjects scored below 
the norms. From the age group of 6-0 and beyond, the subjects equalled or 
exceeded the normative scores. Generally, the standard deviations associated with 
the scores of the subjects were higher than those of the normative study. This 
generality is tempered with the size of the subsamples. 


Test of significance. A Fisher’s zr transformation was nonsignificant at the 
0.05 level. The nonsignificance was obtained with an r=-.91 for the normative 
data and an r=-.84 for the French-immersion subjects. These negative correlation 
coefficients reflect the relationship between the mean correct responses for the age 
categories and their respective standard deviations. From the results, it appears 
that the groups were equivalent. 

Item analysis. The percentage of correct responses grouped according to 
syntactic category and age for the French-immersion subjects is shown in Table 3. 
The criterion in the item analysis is based upon 90%, which was used with the 
SPLT. Syntactic categories in the table were generally ordered according to the 
attainment of criterion. For example, under the syntactic category of Imperative 
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(items 20 and 21), subjects in the 4-0, 4-6, 5-0, 5-6, 6-0, and 6-6 groups scored 50, 
68, 33, 72, 67, and 67% correct, respectively. For the remaining age groups 7-0 to 
8-6, subjects scored all items correct. It can be seen from the underscore at the 
age range of 4-0 to 4-5, that 90% of the subjects from the normative data scored 
this item correct compared to the 50% for the French-immersion subjects. An 
examination of the table indicates that, for the syntactic category of Reflexive 
Pronoun (items 38 and 39), subjects in their respective age ranges scored 0, 6, 16, 
11, 0, 38, 25, 62, 42, and 40%. As indicated, subjects in both groups failed to 
attain criterion by age 8-11. 


TABLE 2 


A COMPARISON OF WERNER AND KRESHECK (1974) 
AND FRENCH-IMMERSION STUDIES WITH MEANS, 
STANDARD DEVIATION, AND NUMBER OF SUBJECTS 


Werner & Kresheck@ French Immersion 

x SD n x SD n 
4-0 to 4-5 64 12 15) 64 14 6 
4-6 to 4-11 74 alee. 20 68 14 8 
5-0 to 5-5 79 9 30 70 Tah 3 
5-6 to 5-11 81 9 62 Tu ily 9 
6-0 to 6-5 80 8 47 82 3 3 
6-6 to 6-11 82 9 37 82 9 3 
7-0 to 7-5 86 8 23 90 2 6 
7-6 to 7-11 88 7 48 91 5 4 
8-0 to 8-5 93 5} 33 91 7 10 
8-6 to 8-11 92 3 15} 92 6 5 


“ Normative data published with permission of the authors and 
Janelle Publications. 


As a means to (solely) reduce data, items 1 through 6 were excluded from the 
table. These items, which included prepositions and singular and plural nouns, 
were excluded because the subjects exceeded the norms. It was felt that no 
important information would be lost or gained by their inclusion. Table 3 does, 
however, reflect the other items germane to these grammatical forms and are 
presented therein. 


The statistical analyis revealed no long-term, overall difference between the 
two groups, but differences in the obtained scores for some of the age categories 
were noted. Table 2 indicates where the differences lie. It reveals two major trends 
in the expressive syntactic abilities of the children under study in reference to the 
normative data. First, the scores of the subjects were similar to or better than the 
normative data in the areas of prepositions, singular and plural nouns, irregular 
present tense, past tense, infinitives, conjunctions, and possessive nouns. Second, 
at least a two-year lag was evident in a number of syntactic structures in 
achieving the 90% criterion. These structures include: imperatives, negative 
imperatives, possessive nouns, reflexive pronouns, third person singular present 
tense, negatives, yes-no interrogatives, and passives. 
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TABLE 3 


PERCENTAGE OF CORRECT RESPONSES OF FRENCH-IMMERSION 
SUBJECTS GROUPED IN SYNTACTIC CATEGORIES 


Item Age (grouped in 6-month clusters) 

Category From BELT 4-0 4-6 5-0 5-6 6-0 6-6 7-0 7-6 8-0 8-6 

Imperative 20,21 50 69 33 72 67 67 100 100 100 100 
Negative Imperative 22 67 Whey 67 100 67 100 100 100 90 100 
Possessive Noun 30 83 87 67 89 100 100 83 +100 95 100 
28,29 83 93 67° LOO! 100" 100" 100°” 200) 100 “108 

Reflexive Pronoun 40 50 50 0 67 33 67 83 100 90 80 
38,39 0 6 16 11 0 33 25 62 42 40* 

Y/N Interrogative 43,44 41 56 83 78> 7 LOOT 00" 1007) 100 95997 100 
41,42 50 58 50 61 100 83 84 100 100 60 

Present 7,8 91 68 83 67 83 67 83 100 85 100 
(3rd Person Singular) 2 100 50 100 89 100 # 100 100 100 100 100 
Future 9 67 100 267) 78 100 100 100 100 100 80 
11 50 62 67 LSE) LOO 00 00 75 90 80 

Conjunction 25 83 50) LOOk 00) = 00 67 100 100 100 += 100 
23,24 75 81 83) 5°2100) 100." 1005 200) 100! 100% > 1100 

Infinitive + Different Subject 33) 67 (icy villoye) 1897 100 100) -2'00'" 00> 200" 100 
Negative 35,36 8 63 16 Wyk 16 67 91 100 90 100 
34 Soe LOO 00 00 67 100 83 100 90 60 

Past 10 67 87 100 On LOO mm LOO 83 100 100 100 
ils yoilt sy aL 7) 33 50 67 67 76 67 89 58 93 93% 

Present (3rd Person Plural 14,15 83 75 67 67 100 83 92 100 100 100 
Infinitive 19 83 100 100 895 S100 10055 100 100 1001100 
18 50 62 100 89 67 67 83 50 80 80* 

Wh Question 45,46 41 43 67 5 5rLOO 67 100 = 100 90 100 
Participle 31,32 67 56 83 92 83) § 100 83 86 95 90 
Passive 37 0 12 0 33 0 0) 17 0 30 40 


__= age level at which 90% scored correct in normative data 
* = items on which 8-11 year children in normative data did not reach 90% correct criteria. 


Another and important observation can be made from the table. For all 
syntactic categories except conjunctions (and excluded items 1-6), there is a period 
of linguistic growth followed by a period of decline and finally a second period of 
growth. Although there are exceptions, this second stage of linguistic growth 
generally exceeds SPLT criterion of 90%. Two exceptions are found with 
negatives (34) and future tense (11). The pattern in these categories tends toward 
early correct usage and then decline. Normative data were not available for item 
analysis; therefore it was not possible to determine how much of this observation 
can be attributed to normal linguistic development and/or linguistic interference. 


Discussion 


The purpose of this study was to compare and to identify the shorter-term 
differences in expressive syntax of children enrolled in a French-immersion 
program to the normative data compiled by Werner and Kresheck (1974) on 
English monolingual children. From the statistical analyses, no overall difference 
was found between the data on French-immersion subjects and the normative 
data. With this major finding, however, it is important to note that unique 
differences between the groups did exist, along with some noted trends. 

The French-immersion and normative-data subjects initially scored similarly 
on the SPLT at the age range of 4-0 to 4-5 as shown in Table 2. The 
French-immersion subjects in this range are beginning the immersion program and 
their second language skills are not developed. At this point, they can be 
considered monolingual because of their minimal development of French language 
skills. This would account for the similarity in scores. 
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From the age range of 4-6 to 5-11, French-immersion scores were lower than 
those reported in the normative data. It is possible that some language 
interference as discussed by Lindholm and Padilla (1978) was causing this 
difference. Subjects at this age level were in JK and K and were learning and 
developing more extensive use of French. The program was concentrating on their 
use of French expressive syntax skills. Possibly, this was at the slight expense of 
the English skills. The differences, however, were within 2 standard deviations of 
the mean, which is considered within normal limits in the SPLT. Monolingual 
children obtaining these scores would not be considered language delayed until 
their scores were more than 2 standard deviations lower than the mean (Werner & 
Kresheck, 1974). 

At the ages of 6-0 and above, the French-immersion subjects were performing 
as well as or better than the subjects reported in the normative data; an 
observation made by Maynard (1980). This occurred even before English-language 
arts instruction commenced during the second grade. This finding supports the 
notion that the subjects have developed linguistic coping strategies as outlined by 
Ben-Zeevy (1977), which enable them to reduce and/or eliminate linguistic 
interference that may occur with respect to second language learning. 


The subjects’ performance in this study was also similar to those of 
French-immersion students in the PSBGM. In these schools, English language 
development skills of French-immersion children were comparable to their 
monolingual peers at the end of grade 3 (Genesee, 1979); the French-immersion 
subjects in this study compared similarly to their English monolingual peers by 
grade 1. Results of this study were also consistent with the findings of Swain 
(1978). In Swain’s study, as in this one, bilinguals performed similarly to 
monolinguals on certain aspects of English at one grade level and outperformed 
them at a later grade. In reference to the normative data of Werner and Kresheck 
(1974), better performance was found for the 7-year-old (generally second graders) 
bilinguals than for the normative subjects. This difference disappeared by the 
third grade, where subjects were functioning at a criterion level of at least 90%. In 
Swain’s study, these performance characteristics were seen in grades 4 and 5, 
respectively. 


Item Analysis 


Carrow (1972) reported that bilinguals show difficulty with receptive 
understanding of the syntactic areas of nouns, pronouns, plurality of nouns, and 
noun phrases with two adjective modifiers. Nez Percé subjects (Ramstad & Potter, 
1974) demonstrated linguistic inadequacy in the areas of prepositions, possessives, 
verb tense, and possessive pronouns. Subjects in this study did not demonstrate 
problems with these syntactic errors in expression with the exception of one form 
of verb tense, the third person singular present tense. 


Subjects in this study had not mastered the imperative category until age 7-0. 
However, in escorting and introducing the subjects to the testing area and the 
study, the examiner observed the correct use of the imperative tense in the 
spontaneous speech of many children in JK. It is felt that the discrepancy was 
due to failure of the test instrument to adequately measure this specific expressive 
syntactic category. The stimulus (photograph) was of poor quality — the 
photograph was dark and the items to be observed were not readily discernible. 
The verbal eliciting statement did not yield the desired response. Prompting was 
required in many cases before the subjects gave the correct response. 
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The passive tense showed the lowest overall mean percent correct for the 
French-immersion subjects. Although this construction was not observed in 
spontaneous speech by the examiner, it is felt that the scores were low because 
the use of passive voice is nonobligatory in adult language. The stimulus was a 
picture of a car involved in a motor vehicle accident. The eliciting statement was 
long and involved. Most subjects gave an appropriate response, but failed to use 
the passive tense, instead using active tense even after prompting. The 
photographs and eliciting statements for the two categories, imperative and 
passive, were the only items in the test instrument judged by the examiner to be 
inappropriate for eliciting the desired response. 


Clinical Implications 


Miller (1978) discusses issues relative to the bilingual. His concerns are 
consistent with the present study. At least three areas are worth delineating as a 
consequence of the results reported herein. 


First, the age-maturity factor should be considered. It is well known that 
language and communicative skills mature with age. With the bilingual, it is more 
than difficult to separate linguistic interference, immaturity, and variables 
unrelated to language and communicative development. Any assessments made by 
the professional should, initially, be guarded because of these variables. 


The second factor, not unrelated to age-maturity, is length of time in the 
French-immersion program. Since this study, like others, found no lasting 
differences, speech-language pathologists’ expectations of performance should be 
guided by existing and availabie norms. For example, the French-immersion 
subjects lagged behind the normative data up to the age of 6-0, at which time 
scores were similar to or petter than the normative data. A child from a 
French-immersion program thought to have a language delay should be reassessed 
around the age of 6-0. At this point, the delay should have disappeared. 


The third factor is the type of syntactic error. If the child is making errors 
similar to the ones made by the subjects in this study, there may be less cause for 
concern about a perceived language delay. However, if the child is making errors 
on items expected to be mastered at an age more than two years below the 
chronological age, there is need for concern. For example, a child aged 6-6 who has 
not yet mastered the prepositions and plural nouns as measured by the SPLT 
would likely benefit from language therapy. On the other hand, if a child of the 
same age fails to master forming the Y/N interrogatives, for example, it is likely 
that the item would be mastered at a later age without benefit of therapy. 


Children in a French-immersion program suspected of having a language delay 
should be closely monitored to determine progress in all areas of language 
acquisition. Speech-language pathologists in the schools generally have large 
caseloads and may need to accept clients on a priority basis. In this case, priorities 
can not be based solely upon severity and/or deviancy. As Miller (1978) indicates, 
one must consider the child’s total environment in order to understand the unique 
needs before developing the therapeutic plans, if any. 


Conclusions 

The results of this study support the hypothesis that there were no lasting 
differences in the English expressive syntactic skills of French-immersion subjects 
relative to the English monolingual subjects who made up the normative data. In 
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fact, the slightly higher scores, starting at age 7-0, may indicate that these subjects 
develop expressive syntactic skills at a faster rate than monolinguals. It should be 
noted that when compared with English monolinguals, the French-immersion 
subjects, starting at age 7-0 and above, made the same number of errors or fewer. 
An item analysis revealed that although the same number of errors were being 
made, the errors occurred in different syntactic categories. The French-immersion 
subjects revealed scores at least two years lower than the norms in the specific 
syntactic categories of imperatives, reflexive pronouns, present tense verbs (third 
person singular), negatives, Y/N interrogatives, and passive tense. 


Performance of expressive syntax was similar to the results of tests of English 
language development administered to children in similar programs. 


The success of this study can be seen in the consistency of trends with related 
studies. Obtained variances can be attributed to the specific area investigated, 
that is, expressive syntax. On the other hand, some variation, not necessarily 
adverse, can be attributed to the small sample size; this may have affected the 
reliability (in this case, accuracy) of specific scores. The SES of the subjects and 
the SPLT, which measured a different aspect of syntax, can account for some of 
the differences with other studies. 


It would have been more than helpful to have collected normative data on 
English monolinguals in the population for comparison with these 
French-immersion subjects. This problem is a major one and not unique to the 
present investigation. 
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Is There an Identifiable Stress Cycle 
in the School Year? 


From a random survey of 230 personnel in various role-groups of a large British 
Columbia school district, it is suggested that an identifiable stress cycle occurs 
with agreed pattern and regularity and is measurable and _ statistically 
significant. A meaningfully different stress cycle, over the school year, was 
identified by elementary as compared to secondary teachers. A broad review of 
the stress literature, beginning with Selye, is included. 


In recent years major attention has been turned to the phenomenon of human 
stress, not only in medicine and psychology, but in virtually every one of the 
health science professions. In Spring 1980, the British Columbia Teachers 
Federation undertook a province-wide survey of teachers’ opinions with a goal to 
forming a list of identified or perceived “stressors” in their work environment at 
the schools.! The study being reported here does not attempt to identify specific 
stressors, or causes of stress per se. Rather, we were concerned with the possible 
presence of a stress cycle, or rhythm from month to month during the school year, 
as it may be perceived by various professional and clerical personnel within a large 
school district. Viewing the school district as a system, we tested the hypothesis 
that there exists a regular and synchronous stress cycle within the ten-month 
school year, and that it is possible to identify and measure the cycle or cycles by 
having personnel within the system rate their own perception of stress from the 
perspective of their various role groupings. If there is such an identifiable stress 
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cycle in the system, based on the simple variable time of year, there may be 
worthwhile adjustments for the system by such simple maneuvers as elective 
special event timetabling changes. 


The findings of the survey appear to indicate that, at least within the studied 
school district, there is an identifiable stress cycle, with an agreed pattern and 
regularity which is measurable and statistically significant. The perceptions of 
personnel in several different roles congrue strongly on within-group percepts with 
respect to the cycle’s periodicity and amplitude, while they show between-group 
differences in perceived stress level, though not on periodicity. A striking 
difference appears between elementary and secondary teachers. 


Literature Review 


The original work by Hans Selye (1976) which centered on physiological 
responses to stress was pioneered in the early 1930’s and has developed into his 
thesis that the body has a General Adaptation Syndrome (GAS) which can be 
triggered by a wide range of environmental and mental stimuli: bacteria, noise, 
temperature, worry, conflict, excessive demands. Once the “alarm system” is 
initiated in the body, a fairly predictable biochemical sequence of events occurs. 
This may be summarized in Cannon’s fight/flight theory in which the body is 
mobilized for emergency situations, Selye’s Stage One. However, the body’s ability 
to tolerate arousal is limited. With prolonged activation, the body enters into 
Selye’s Stage Two, that of adaption or resistance, in which the body continues to 
adapt to the demands placed upon it. At this stage, the person usually feels alert, 
mobilized, and symptom-free. If continued adaption is required, the body finally 
enters into Stage Three, that of collapse during which the body becomes depleted 
and exhausted. At this point, we become susceptible to disease, illness, and other 
forms of dysfunction. 

Lazarus (1966), in developing his model of stress focussed not on the 
physiological process, but rather on the person and his reaction or response to a 
stressor, that is, the transaction between an individual and a stress event. This 
response is mediated by an appraisal network which consists essentially of 
cognitive and affective memory scanning. Based upon the degree of threat 
perceived, the individual then reacts holistically: that is, physiologically, 
psychologically, and behaviourally. 

Further work on the concept of stress has been pioneered by Holmes and Rahe 
(1967), in which they relate health changes to the history of recent significant 
changes in one’s life system (divorce, being fired at work, outstanding personal 
achievement). The more changes one experiences within a limited time frame, the 
less adaptive energy one has to handle stressors; consequently, there is an 
increased susceptibility to dysfunctioning, whether physical, psychological, or a 
combination of the two. At this point in the development of stress research, it is 
still not possible to predict which individuals will become ill or dysfunctional 
because of stress overload. However, the degree of life change has been related to 
such things as cardiac disease, minor illness, fractures, athletic and traffic injuries, 
depression and alcoholism (Petrich & Holmes, 1977). The current work of Kobasa 
(1979) suggests there may be a hardiness factor to help explain why some people 
become ill and others do not, even though both are facing a similarly high degree 
of stress. The work of Friedman & Rosenman (1974), in identifying Type A and 
Type B behaviour, suggests that those with a definite Type A Personality show a 
greater risk for cardiovascular disease. 
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The connection between psychosocial variables and health change is apparently 
very complex and multidimensional. Pelletier (1979), after analyzing the stress 
research, concludes that the effects of stress are dependent upon: 

1. the degree of individual reactivity to a stressor, 

2. the duration of reactivity over time, 

3. the background of long-term physiological arousal, 

4. whether the pituitary or adrenomedullary system is more activated by a 
stressor. 

In summary, research indicates that susceptibility to a stress reaction is 
holistic and is a function of the frequency and amount of stressors, of the degree 
of perceived threat of a stress event, of coping skills, inherent biological reactivity, 
diet, exercise, stage of life, and of social support systems. 


Research Method 
A survey questionnaire was circulated through school principals to a random 
selection of whole schools, with a return rate averaging 70% quite uniformly 

across role groups. Information was anonymously collected in respect to 17 

variables: 

1: Identification of respondent’s role group: Elementary classroom teacher; 
Secondary classroom teacher; Principal, Vice-principal or administrative 
assistant; Secretary in a school administration office; Janitors. 

2-13: A Likert Scale of 5 categories by which each month of the calendar year 
could be rated: very little stress, 0; some stress, 1; fairly stressful, 2; quite 
stressful, 3; highly stressful, 4. 

14: Years of education. Calculated with grade 12 counted as 12 and adding each 
completed year of university or postsecondary (e.g., secretarial) training. For 
example, grade 12 plus five-year BEd counted as 17 years education. 

15: Number of years employed in the present occupational role group. 

16: Age of respondent in years, as of the past birthday. 

17: Male or female. 


Kamloops School District 


The Kamloops School District serves a population area of approximately 
70,000 with 37 elementary and 10 secondary schools. The district employs 966 
teachers (50 of which are principals, vice-principals, and administrative assistants) 
and approximately 100 school secretaries. The survey was randomized by the 
method of selecting 5 elementary and 2 secondary schools from the whole district 
and then asking the principal of the selected schools to have each staff member 
complete the questionnaire. Specific requests were made to be sure that all staff 
had an opportunity to respond, including nonteaching personnel. The remaining 
schools (40) were sent a different request, through their principals, asking to have 
only the administrators, secretaries, and janitors complete the survey. An attempt 
was made in this way to obtain a random sampling of school district 
administrators, teachers, secretaries, and janitors. 

Initially, returns were obtained which differentiated principals from 
vice-principals and administrative assistants, but the 28 principals were pooled 
with the 16 vice-principals and administrative assistants as their individually 
calculated statistics on the five parameters of Likert Scale, years of education, 
years in role, age, and sex correlated highly (mean r=0.96). 
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ROLE GROUP NUMBER MALES FEMALES 
Elementary Teachers 90 a3 57 
Secondary Teachers 54 39 15 
Principals, Vice-Principals 44 40 4 
and Administrative Assistants 
School Secretaries 28 0 28 
Janitors 14 or 3 


Differential Perceptions of Annual Stress Cycle by Role Groups 


The five role groups depicted in Table 1 show differences in their various 
education attributes as might have been expected. The most significant aspect of 
the various role groups’ perceptions of the school year stress cycle is shown in 
Figure 1, where the Likert Scale ratings for each month are graphed. One of the 
most frequently asked questions by persons taking part in the survey was to know 
if the secondary and elementary teachers rated the year differently. Although the 
mean rating over the twelve-month period, comparing these two groups, is not 
statistically significant, the ratings for the months of August, October, December, 


Secondary ——— ‘ 
Elementary — — — / ai 
Principals --=-= wo 1 
\ 
| eet. a 


++. - + f+ —- 
Aug Sept Oct Nov Dec Jan Feb Mar Apr May June July 
Figure 1. Mean monthly stress perceived by secondary and elementary teachers and 


principals in Kamloops. 
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January, February, April, and June showed varying significance levels, as shown in 
Table 3. The inordinately high rating which June received by all role groups 
(slightly less by secondary teachers) may be influenced somewhat by the fact that 
the survey was administered during the latter half of this same month. 

It appears that although the three role groupings of secondary teachers, 
elementary teachers, and principals live and work in a commonly perceived 
periodic stress cycle, throughout the months of the school year, they differ greatly 
on the degree of stress variation from month to month. The elementary teachers 
appear to experience four strong peaks in stress during the year, while the 
secondary teachers experience two peaks, and with a less marked diminution 
between the two peaks. 


The school janitors and custodians were included in the survey, but showed no 
pattern that was statistically capable of differentiation from randomness. 
Nevertheless, their grouped data are included in Tables 1 and 2. 


TABLE 1 


CHARACTERISTICS OF PERSONNEL IN FIVE ROLE GROUPS 


12-month 
Years Years Totals for 
Role Group N in Role Education Likert Scale 
Secondary Teachers 54 36.796 Sig sia” 7S By? 18.685 
8.034 6.022 0.935 O032 
1.093 0.820, ORE27, 1.229 
Elementary Teachers 90 35.244 Si, 3167) 16.200 16.867 
6.895 5.948 ls Oe 7.810 
Op, O62, 0.180 0.823 
Principals, V-Principals 44 LD ISS) Op 227) 17.659 ISS AS) 
& Admin. Assistants 6.234 6.342 IS )s'3) Siylicali 
0.940 0.956 Opl>6 he SUS 
School Secretaries 28 39.893 7.214 uae Sy il 13.893 
9.418 3.695 0.997 10.282 
tL = 7Aske) 0.698 0.188 1.943 
Custodians & Janitors 14 Les) hint TG} S7fe 10.000 eyo al 
9.921 Bd Gull 1.468 6.592 
DY Moysyil 1.540 ONS 92 Oz 
Note. Statistics are: Mean 
See 
Die Lie ie 


Statistical evaluation of possible relationships between perceived stress level 
and all other reported variables failed to disclose anything meaningful, with one 
minor exception. A small inverse correlation between reported stress and age was 
found in the elementary teachers (r=-0.29). 
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TABLE 2 


LIKERT SCALE RATINGS FOR EACH MONTH BY ROLE GROUPINGS 


Role Group Aug. Sept. Oct. Nov. Dec. Jan. Feb. Mar. Apr. May June July 


Elementary 222 007 sa L256 Vadis 72056") i100) 2 1833) 2d al G56 2.811) 05133 
Teachers O41 e197, 0.955 1 L044 Lo173 10765) 15247 1.189 1690 
0.044 0.126 Deskeih Weal loleksyA OSTA OMe) OIA folks) Oa IAS. 1007/5) 


Secondary 0.259 2.2415 1.574 WD DO 685 O67 mer 870) pel IO 3a Ll 704g Od 5eueo 2270 Oe 30 
Teachers OR S2i LTAS eel Oll 8 09S) Tee 3e TOS 2 NPT. 182) el, 258ne 26cm leo so Net COO mmEOR O16 
OT LOC Old 5OmmmO G2) O49) On 158) sem OU 7mm Oed Gimmes 7 emmOed74 gene lam O edi Samo KOB4 


Principals, 0.409 2.000) 10237 2 13068 1.386 «1.227 1.636 2.545 0.068 
V-Principals, 0.622 1.201 0.876 OO Ze. O09 Lae Of O70 O72 V5039" 0,985) 9 16222) 1337 0255 
Admin. Assts. 0.094 0.181 0.132 0 O.147 0.157 O.149 0.184 0.202 0.038 


School OR dL 23279 058930 S57 © (07860 0) 5SGuuel 0715 ll. COO umealnazt 2.500 0.214 
Secretaries ee el SOG 200m 6 1.224 LOST ORIG2 5 225) 5 lesb Pei88) es 2915 405568 
OS 240) VOR2479" 0.230)" 05195 710.231 0119 5 0'.18. 2558 105230) OL e 24a On2255) 0.244 gem 0107 


Custodians, 0.214 0.714 0.500 0.643 1.429 0.929 1.000 071 1.000 0.643 0.8577 O2071 


1 
Janitors OTCO 200 825.m iO. 519 Q633 gp 222 207, VP. LOoF “0829 ee 1038 1.082 W292" 05267 
OFZ a ee O221 s O5139) 0.169 05327.) 0.5225 02 2965 eNO, 22 Ol 77 0 2090 S45) mC gat 
Note. Statistics are: Mean 
S.D. 
S.E.M 


TABLE 3 


COMPARISON OF LIKERT SCALE RATINGS OF EACH MONTH 
BETWEEN SECONDARY AND ELEMENTARY TEACHERS 


Month t ratio Confidence Level 
August 7.90 p<-001 
September 0.87 not significant 
October 1.67 p<.05 
November Ols79 not significant 
December Te 9 p<.05 
January 30D p<.002 
February 3.38 p<.001 
March 0.62 not significant 
April 2D 5h) jasoOit 
May OR not significant 
June ZieiLO p<.005 
July 0.03 not significant 


a ee SS 


Summary 

Pelletier identified reactivity to a stressor over time as being significantly 
involved in stress symptoms. This suggests the possibility of stress cycles among 
certain professions, especially those that have variable work demands placed upon 
them. The survey of professional and clerical personnel in one school district 
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i 


indicates that such a cycle is identifiable, and that this cycle is significantly 
different between elementary and secondary schools. 


Should further research support the findings of this study, there are practical 
steps which might be taken to reduce the stress overload on teachers, for example, 
elective or special events might be timetabled to low points in the identified stress 
cycle. 


Notes 


fi Correspondence with Nora Grove, B.C.T.F., 105-2235 Burrard Street, Vancouver, B.C., V6J 3H9. 
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Prospective Elementary Teachers’ Proficiency 
in Grade Six Mathematics 


This study was designed to discover the level of proficiency in grade six 
mathematics of students in a mathematics curriculum and instruction course, 
and to examine any relationships with instructional modes and with student 
attitudes toward and ranking of mathematics with other fields of study. A 
pretest and posttest were administered to the students, consisting of a 
mathematics inventory, an attitude measure, and a ranking scale. Results 
indicated: (1) a mean proficiency level of 60% on both pre- and 
postmathematics inventories, (2) no relationship between instructional mode 
and proficiency, (3) a significant correlation between attitude and proficiency, 
and (4) a strong relationship between ranking of mathematics and proficiency. 


The young adults who enter teacher training programs come in with a wide 
range of abilities and attitudes. Assumptions made about threshold proficiency or 
mastery levels of these students are often found by instructors in the course of the 
program to- be over-optimistic. Several studies have been concerned with this 
inadequate background knowledge in mathematics and have examined the effects 
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of instructional modes on this knowledge in courses in mathematics and 
mathematics curriculum. 

Reys (1968) investigated the understanding of contemporary mathematics by 
elementary education majors at the University of Missouri. Fifty-five percent of 
his sample scored below the median of year eight and nine pupils on a 
contemporary mathematics test. It might be expected that today’s teacher trainees 
have a better understanding than Reys’ study indicated, as they have been 
schooled in the “new” curriculum themselves. If not, then there is still a critical 
need for improving the mathematics preparation of elementary teachers. Other 
studies attempted to study the effect of a laboratory lecture approach in 
teacher-training courses. Fitzgerald (1968) concluded that students’ mathematical 
competence was unchanged by their laboratory experience with manipulative 
materials selected to complement the concepts considered in lectures. Warkentin 
(1975) found that a laboratory approach in teacher-training courses had a positive 
effect on attitude and the lecture approach appeared to increase course 
achievement. 

In an earlier paper, Effect of Mode of Instruction on Prospective Elementary 
Teachers’ Attitudes Towards Mathematics (Squire, Cathcart & Worth, 1981), the 
writers outlined a summary of attitudes to mathematics of 269 prospective 
elementary teachers enrolled in a mathematics curriculum course at the University 
of Alberta. The same group of students were also surveyed for their knowledge of 
basic mathematical skill and items commonly found in grade six mathematics 
programs and texts. 

The present report is about the same students’ responses on a specially 
prepared inventory of mathematics items, and examines the relationships with the 
instructional modes employed in the curriculum course and with the students’ 
attitude to and ranking of mathematics with other fields of study. 


Method 


Subjects 


The subjects were 269 prospective elementary school teachers enrolled in a 
one-term introductory course, Education Curriculum and Instruction 216, at the 
University of Alberta. The course was designed to examine concepts, topics, and 
instructional procedures in the mathematics curriculum of grades K through six. 


Instructional Groups 

The course was presented by five instructors who each had one group of 
students for the whole term of 13 weeks. One group (L), with 174 students, 
received three 50-minute lectures each week. The other four (S), with 30, 22, 25, 
and 18 students, had one lecture, a laboratory session, and a seminar each week. 


Instrument 


A 25-item Elementary Mathematics Inventory (E.M.I.) was constructed 
embodying aspects of the grade six mathematics curriculum. Each item was of 
multiple choice construction and was scored on I.B.M. answer sheets for machine 
scoring. 

The students’ attitudes toward mathematics were measured (as reported in the 
earlier paper) using an 11-point Dutton self-appraisal scale (D) and a 20-item 
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Likert-type scale (M.A.S.). Finally students were asked to rank the four major 
high school subject areas according to each of four criteria: “the most enjoyed at 
school,” “the most worthwhile to have studied,” “will be the most enjoyable to 
teach,” and “will be my most competent area in which to teach.” 


Procedure 
A pretest and a posttest were administered to the subjects in_ their 


instructional groups in the first and last sessions in the term. The same set of 
questions (described above) was asked on both occasions. 


Hypotheses 
The purposes of the study gave rise to four major hypotheses. 


Hypothesis 1. There is no significant difference between pre- and posttest 
scores on the E.M.I. for (a) each group, (b) groups 2-5 combined (S), and (c) the 
whole sample. 

A univariate analysis with the test as a repeated measure was used to test the 
three parts of the hypothesis. 


Hypothesis 2. There is no significant difference between large lecture group 
(L) and seminar groups (S) on the E.M.I. using pretest as a blocking variable. 

A two-way analysis of variance was performed using treatment (L and S) as 
the experimental variable and pretest performance on E.M.I. as a_ blocking 
variable. Three groups were identified by taking cutting points at +0.50 standard 
deviations from the mean on the E.M.I. pretest. This procedure yielded samples of 
approximately equal numbers of subjects with high, medium and low E.M.I. 
pretest scores for each treatment. 

Hypothesis 3. There is no significant correlation between mathematics 
attitude as measured by (i) M.A.S. and (ii) D and E.M.I. scores for (a) the whole 
sample, (b) each treatment (L and S). 

Pearson-Product-Moment correlation coefficients were calculated between D., 
M.A.S., and E.M.I. scores for each part of this hypothesis. 


Hypothesis 4. There is no significant relationship between scores on the 
mathematics inventory (E.M.I.) and the ranking of mathematics as (a) an 
enjoyable field of study, (b) a worthwhile field of study, (c) an enjoyable field in 
which to teach, and (d) a field in which the student feels competent to teach. 

A cross-tabulation (4 x 3 matrix) and chi-square analysis was carried out for 
each of the four cases. The same cutting points were used as in the Hypothesis 2 
analysis to identify high, medium, and low groups on E.M.I. scores. 


Results 


Pre- and Posttest Mastery Comparisons (Hypothesis 1) 

The mean scores for each group on the pre- and posttest of E.M.I. with F 
ratios and probabilities are presented in Table 1. On the basis of the computed F 
ratios in Table 1, Hypothesis 1 is accepted in all three cases. As indicated there is 
no significant increase in performance on the E.M.I. over the term. The average 
pretest score was 15 correct out of 25 and the average posttest score was 


essentially the same, 60% level of mastery. 
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TABLE 1 


ANALYSIS OF VARIANCE USING PRE-AND POSTTESTS 
AS REPEATED MEASURES 


Group N Pretest Posttest F Teast I 

u 174 14.42 14.31 0.023 0.881 

2 30 16.87 17.03 0.012 0.913 

3 22 $5 27 16573 0.544 0.469 

4 2 16.12 16.16 0.001 0.979 

5 18 15.61 14.72 0.176 0.680 
Zo 95 16.06 16.30 0.070 0.792 
J 269 15.00 15.01 0.000 1.000 


Treatment Comparisons (Hypothesis 2) 


The posttest means and variances for the treatment groups at each pretest 
level are given in Table 2. The number in each Lecture pretest group was 
approximately 60 and in each seminar pretest group, it was approximately 30. 


TABLE 2 


POSTTEST MEANS AND VARIANCES ON E.M.L FOR TREATMENT GROUPS 


Pretest Groups 


Low Medium High 
N xX S N xX S N x S 
Lecture 65 13.6 55 56 13.9 75 53 17.4 55 
Seminar 33 13.6 57 26 14.1 78 36 19.7 30 


A summary of the analysis of variance of E.M.I. scores due to treatment and 
pretest levels is presented in Table 3. As expected there was significant variance 
due to pretest groupings into low, medium, and high achievement levels, but no 
significant effect was indicated for treatment nor was there any significant 
interaction between treatment and pretest level. Hypothesis 2 was therefore 
accepted. 
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TABLE 3 


SUMMARY OF ANALYSIS OF VARIANCE OF E.M.L. SCORES 
USING PRETEST AS BLOCKING VARIANCE 


—_——_—_—..8 060. 0—0—MOSMN.T—osomsmamRhysu#sSs9 Ssuanngngngggg 
Source of Mean 


Variance ace Square F rene 
Treatment (A) 1 68.6 a A 0.279 
Pretest Level (8) 2 643.5 11.02 < 0.001 
Ax B 2 30.8 0.53 0.591 
Residual 263 58.4 


Relationship Between Mastery and Attitude (Hypothesis 3) 


The correlation coefficients among posttest scores for the whole sample and 
for treatment groups are shown in Tables 4 and 5. For the whole group of 269 
students, there is a very significant correlation between the M.A.S. attitude 
measure and the E.M.1. mastery score (p<0.001).For the treatment groups, there 
is significant correlation again between M.A.S. and E.M.I. scores at the 0.01 level 
for lecture treatment, and at the 0.001 level for seminar treatment. 

Even though there was significant correlation (p<0.001) between the two 
attitude measures (M.A.S. and D), no significant correlation is indicated between 
D and E.M.I. scores. Hypothesis 3 is thus accepted in relation to the Dutton 
11-point scale attitude measure, but rejected in relation to Aiken’s 20-item 
attitude scale. 


TABLE 4 


CORRELATION AMONG POSTTEST SCORES FOR THE WHOLE GROUP 


D M.A.S. EM. 
D 1.000 0.597% 0.117 
M.A.S 0.597% 1.000 0.292% 
E.K.L 0.11/ 0.292* 1.000 


* p<0.001 


Relationship Between Mastery and Ranking of Mathematics (Hypothesis 4) 


The nonparametric analysis employed to test Hypothesis 4 produced 
two-dimensional tables of frequencies of the four possible rankings available to the 
students versus proficiency categories on the E.M.I. pretest. A summary of the 


four tables so generated is given in Table 6. 
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TABLE 5 


CORRELATION AMONG POSTTEST SCORES FOR THE TREATMENT GROUP 


OF Bs is) 


0.387%% 


1.000 


D M.A.S. E.M.1. 
: S L Ss L 
D 1.000 1.000 0.531%% 0.683%** 0.167 
M.A.S. 0.531 0.683%** 1.000 1.000 0.272% 
E.M.1. 0.167 0.135 0.272% 0.387%* 1.000 
* p<0.01 
xx p<0.001 


A strong relationship can be seen between mastery level and ranking of 
mathematics. There was a general trend for students with a higher mastery 
(E.M.1.) score to rank mathematics higher on each of the four criteria. The 
number of students who completed the ranking exercise was only 183 (out of 269). 
This depletion resulted from 68 out of 98 of the Jow E.M.I. group filing a null or 
void return on this part of the survey. It could be inferred that if valid returns 
had been submitted by this group, an even stronger negative ranking would have 


been evident in the L column in Table 6. 


The relationship is established statistically by the chi-squares in all cases being 
highly significant (p<0.001). Hypothesis 4 was therefore rejected for all four 


ranking criteria. 
TABLE 6 


FREQUENCY TABLES FOR RANKING OF MATHEMATICS 
BY STUDENTS GROUPED ACCORDING TO PROFICIENCY 


; : J EMI Pretest Groups Totals 
Ranking criterion Rank L “ a N z 
N=30 N=68 N=85 (183) 
(a) Enjoyable 1 7 16 38 61 S525 
to study 2 7 14 20 41 22.4 
3 6 19 14 39 2153 
4 10 19 i is. 42 22.9 
(b) Worthwhile + 6 17 28 ei! 7 
to study 2 11 22 35 68 S722 
3 if! 21 13 45 24.6 
4 2 8 9 19 10.4 
(c) Enjoyable 1 4 18 38 60 3226 
to teach = 13 18 34 65 & Pe. 
3 8 26 9 43 pee 
4 5 6 4 15 8.2 
(d) Feel competent 1 3 14 35 52 28.4 
to teach 2 13 24 34 71 38.8 
3 12 24 13 49 26.8 
4 2 6 5 me 6.0 


* probability <0.001 
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Conclusions 


The survey indicated a number of significant findings, some of which have 
implications for the teacher educator. The mean proficiency level of 60% on a test 
of grade 6 mathematics seems hardly adequate for a beginning teacher. Since the 
study indicated no increase in proficiency occurring as a result of the curriculum 
course, perhaps a special compulsory course needs to be offered for students with 
less than, say, 80% proficiency in elementary school mathematics. Such courses in 
skills and content have been piloted recently in several Australian Colleges of 
Advanced Education with apparent success. Squire (1979) reports an overall mean 
gain of 19.8% in proficiency of 184 preservice trainee elementary teachers who 
undertook such a skills program of 1 hour per week for a semester. An overall 
posttest mean of 83% was encouraging, although certain areas of skills such as 
statistics and number properties still presented difficulty to 50% of students. 


In the present study the absence of increase in proficiency should not be 
interpreted as a weakness in the curriculum course undertaken by students. It was 
not specifically intended to upgrade skills in grade 6 mathematics, but one might 
have expected such a “spin off” from the examination of instructional procedures 
involving these same skills and concepts. It appears that such an expectation is 
not justified. 

Contrary to several other studies (Todd, 1966; Warkentin, 1975; and Cathcart, 
1975) this survey indicated that mode of instruction made no difference to level of 
proficiency. The test scores remained very much the same irrespective of whether 
students were instructed by three lectures a week or by a 
lecture-seminar-laboratory technique. This is not to say that various instructional 
techniques are not beneficial, for many studies have found positive attitude 
correlation with activity-laboratory methods (Cathcart, 1975; Fuson, 1975; Wall, 
1973; and Warkentin, 1975) when used with prospective teachers. 


A significant correlation between attitude and proficiency was shown in the 
present study, at least when the attitude measure was that given by the Aiken 
20-item, 5-point scale. This is interesting to note, even though Suydam and 
Weaver (1975) comment that there is no consistent body of research evidence to 
support an attitude-achievement correlation in school pupils in relation to 
mathematics. Aiken’s (1972) claim of reliability and validity for his scale when 
used with postsecondary students gives added weight to the present finding of 
attitude-proficiency correlation. 

The strong relationship between proficiency level and ranking by students of 
mathematics on all four criteria is further support for a link between attitude and 
profiency. The four criteria all express an attitude towards mathematics either of 
enjoyment, value, or feeling of comfort. It would seem important therefore to 
attempt to develop positive attitudes towards mathematics along the lines 
indicated by the ranking criteria if prospective elementary teachers are to develop 
mathematical proficiency also. This is not a new idea for mathematics educators 
but it is one that needs to be constantly activated. 

Two final comments on the study concern controlling instructor personality 
factor and testing a wider range of skills. If the staffing and timetabling situation 
enabled it to be done, the large lecture group and at least one of the smaller 
seminar-lab groups could have the same instructor. While the course presented by 
all five instructors was team planned and as uniform in content as possible, it is 
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readily admitted that some instructor bias could well have influenced results. A 
measure of control over this would be desirable, if not essential. 


Finally, the E.M.I. items spanned only five topic areas: numeration, rational 
numbers, spatial concepts, measurement, problem solving. Pirie (1979) advocates a 
list of 18 topics of skills and knowledge which must be mastered by a person who 
is to be classified as functionally numerate. A beginning teacher should be such a 
person, and a useful framework upon which to plan a program for preservice 
students might be the list of basic skills suggested by the National Council of 
Supervisors of Mathematics (1977). These would seem to embody Pirie’s 
suggestions which were based on the summary of the N.A.E.P. Mathematics 
Assessment by Carpenter, Coburn, Reys and Wilson (1975). 


References 


Aiken, L. R. Research on attitudes towards mathematics. The Arithmetic Teacher, 1972, 
19(3), 229-234. 

Carpenter, T. P., Coburn, T. G., Reys, R. E., & Wilson, J. W. Results and implications of 
the N.A.E.P. mathematics assessment: Elementary school. The Arithmetic Teacher, 
1975, 22(6), 438-450. 

Cathcart, D. C. The effects of activity instruction in mathematics on prospective elementary 
school teachers. (Doctoral dissertation, Indiana University). Ann Arbor, Mich.: 
University Microfilms, 1975, No. 75-16975. 

Fitzgerald, W. A math lab for prospective elementary school teachers. The Arithmetic 
Teacher, 1968, 15(6), 547-549. 

Fuson, K. The effects on preservice elementary teachers of learning mathematics and means 
of teaching mathematics through the active manipulation of materials. Journal of 
Research in Mathematics Education, 1975, 6(1), 51-62. 

National Council of Supervisors of Mathematics. National Council of Supervisors of 


Mathematics Position Paper on Basic Skills. Arithmetic Teacher, 1977, 25(1), 19-22. 
Pirie, I. G. Basic numeracy skills essential for the beginning teacher. Focus paper on 


mathematics skills in preservice courses. Proceedings of the THIRD BIENNIAL 
CONFERENCE of Mathematics Education Lecturers’ Association (MELA) of 
Australia, Feb., 1979. 

Reys, R. Mathematical competencies of preservice elementary school teachers. School 
Science and Mathematics, 1968, 68(9), 302-308. 

Squire, B. F. Diagnosis and planning for remediation for pre-service trainees. Focus paper 
on mathematics skills in preservice courses. Proceedings of the THIRD BIENNIAL 
CONFERENCE of Mathematics Education Lecturers’ Association (MELA) of 
Australia, Feb., 1979. 

Squire, B. F., Cathcart, W. G., & Worth, J. E. Effect of mode of instruction on prospective 
elementary teachers’ attitudes toward mathematics. Alberta Journal of Educational 
Research, 1981, 27(1), 35-45. 

Suydam, M. N., & Weaver, J. F. Using research: A key to elementary school mathematics. 
The ERIC Science, Mathematics and Environmental Education Clearinghouse, 
December, 1975. 

Todd, R. M. A mathematics course for elementary teachers: Does it improve understanding 
attitude? The Arithmetic Teacher, 1966, 13(3), 198-202. 

Wall, C. E. A study of the efficacy of a graduate mathematics methods course in changing 
inservice elementary teachers’ attitudes towards school mathematics. (Doctoral 
Dissertation, Michigan State University). Ann Arbor, Mich.: University Microfilms, 
1973, No. 73-12846. 

Warkentin, G. The effect of mathematics instruction using manipulative models on the 
attitude and achievement of prospective teachers. Journal for Research in 
Mathematics Education, 1975, 6(2), 88-94. 


338 


The Alberta Journal of Educational Research Vol. XX VII, No. 4, December, 1981, 339-351 


LARRY R. COCHRAN 


The University of British Columbia 


A Method for Monitoring Learning Activities 
and Experiences in Studying Prose 


This study is an investigation into the reliability, predictive validity, and 
potential usefulness of a method for monitoring unobservable events while a 
person is studytng. Sixty-six university students studied a chapter, rated the 
way they studied each of 10 sections of the chapter on 10 study variables, and 
completed two tests on the chapter. Subjects were also divided into five groups. 
Three groups were exposed to different types of adjunct questions and one 
group used a method for organizing comparative information. The final group 
was a control group. Findings indicated that self-ratings were reliable, 
predictive of learning, and sensitive to treatment condition. 


The general purpose of this study is to explore the reliability, sensitivity and 
validity of one method of monitoring unobservable events during study. If 
successful, the development of monitors could prove useful in specifying how 
treatment conditions gain their effects, in resolving inconsistencies in results, and 
in explaining failures to replicate. Monitors could also be useful in training 
programs where it is important to determine how reliably or accurately a person is 
executing a particular strategy of learning. 

As a concrete example, consider adjunct questioning (Rothkopf, 1966). In one 
variation, factual questions are inserted in text after the material to which they 
refer. Without looking back, subjects answer these questions and then continue 
reading. When subjects read the next section, they tend to learn more. Factual 
postquestions increase learning in subsequent sections. However, there have been 
marginal results (and consequently, failures in replication) and inconsistent results 
(e.g., Marton, 1976; Moeser, 1978). How can these difficulties be satisfactorily 


resolved? 


Larry R. Cochran(PhD, University of Utah) is currently Assistant Professor at the Faculty of Education, University of British 
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It is doubtful that refinements in treatment conditions and outcome measures 
alone could resolve these difficulties. Indeed, since this paradigm of research 
directs attention to treatment conditions and outcomes, to the neglect of the 
complex experience in between, it could hinder resolutions. That is, after receiving 
factual postquestions, what exactly are people supposed to do that improves 
learning? How people react to factual postquestions is, at best, inferential as long 
as researchers attend only to treatment conditions and outcome measures. 


When exposed to factual postquestions, some subjects might give up in 
frustration, others might become emotional or unsettled, and others might ignore 
the questions, going through the motions just to finish the experiment. To combat 
emotionality, some people might psychologically withdraw, downplay the 
importance of the task, or decrease effort. Others might rise to the challenge and 
increase effort, try to anticipate questions, or highlight facts rather than concepts, 
and so on. Although the treatment intervention looks simple enough, it is a very 
complex event in the ways it could influence people. For this reason, there is 
unlikely to be just one answer to why factual postquestions facilitate learning. 


In a recent review, Anderson and Biddle (1975) conclude that we do not need 
more demonstrations of effectiveness, but demonstrations of why factual 
postquestions work. Yet, given the potential complexity of subjects’ experiences 
and the prevalent method of research, it is difficult to imagine a happy outcome 
for this project. Unless researchers can develop ways to observe (using this term 
loosely) learning processes rather than depending solely on inference, what seems 
more likely is that adjunct questioning will generate a flood of research and 
gradually fade into neglect (a well-trodden path in psychological research). 


What seems to be needed is a method that would allow a researcher to specify 
exactly what is supposed to happen in the way people study and to determine if it 
happened. In this way, a researcher could test various explanations of 
effectiveness, show why one experiment failed to replicate another, and perhaps 
begin to chart the various ramifications of the test-like event. The general role of 
a monitor, then, is to provide a way to “get closer” to the learning processes or 
activities involved in altering learning products. 


There are two major problems that a method of monitoring must solve. First, 
learning activities (e.g., cognitive operations), effort, and emotion are not 
observable or, at least, not observable enough. To make the implicit explicit, one 
might use various forms of note-taking such as directed graphs (Pask, 1976), 
conversational methodologies (Thomas & Harri-Augstain, 1975), and probes 
(Britton, Piha, Davis, & Wehausen, 1978), among other things. In the present 
study, activities and experiences were monitored by self-report. The primary 
limiting factor is that a person can only report what he or she is capable of being 
aware of. 


Second, monitors should either not disrupt or minimally disrupt the learning 
experience. In this experiment, the experience was not disrupted at all, as subjects 
relied on cued memory to report their experiences. Suppose the task were to read 
a chapter~composed of ten sections. If a person made a self-report on supplied 
dimensions after each section, it could bias the experience. (Again, it might not. 
This has to be tested). This procedure is certainly close to the experience, but also 
is conceivably an influence on that experience. To remove the possibility of bias, 
subjects could read all sections. Then, afterward, by using each section as a cue, 
subjects could make a self-report upon what happened. This procedure is less 
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close to the experience, but incapable of influencing it. Its feasibility depends 


upon the accuracy of a person’s cued memory for events that took place shortly 
before. 


In the present experiment, college students studied ten sections of a chapter, 
rated those sections on ten study variables, and then completed two tests covering 
the chapter. The key questions involved at this point are: Are the self-report 
scales reliable? Are the self-report measures predictive of learning? 


There are also five different groups: a control group, a conceptual prequestion 
group, a conceptual postquestion group, a factual postquestion group, and a group 
that was shown a method for organizing comparative information. These 
treatment conditions were included not so much to show differences in learning 
products, but to explore differences in processes. In short, are the self-report 
variables sensitive to treatment conditions? The key questions here are: Does 
treatment differentially affect the degree of learning activities? Does treatment 
affect the way in which these activities are related? Does treatment affect the 
strength of relations among activities? Does treatment affect the predictive 
validity of study variables? 


Method 


Subjects 

Sixty-six students at the University of Victoria were recruited by poster. They 
were randomly assigned to the five treatment conditions, resulting in 14 subjects 
in the organizer group and 13 subjects in each of the other groups. The 
experiment lasted from 2-1/2 to 3 hours, for which each person was paid $9.00. 


Materials 

Textbook chapter. The first chapter of Aubrey Manning’s (1979) An 
Introduction to Animal Behaviour was used as the material to be learned. The 
chapter was divided into 10 sections, which ranged from about 340 words to 590 
words in length. In total, the chapter is approximately 4600 words long. The 
chapter was altered slightly in two ways. First, all diagrams and pictures were 
deleted and the prose altered accordingly. Second, to lengthen the introduction, 
paragraphs from the conclusion were extracted and placed in the first section. 

In this introductory chapter, Manning compared reflexes and complex 
behaviour on six properties: latency, after-discharge, summation, warm-up, fatigue, 
and inhibition. While the chapter is straightforward, it is also filled with details, 
examples, and new terms that could sidetrack injudicious readers and overwhelm 
others. 

Self-report scales. The 10 bipolar study dimensions were: 

1. studied topic just as a required reading/studied topic more because it became 
interesting, 

2. tried to grasp the central point, referring back and forth to make ideas fit 
into a whole/took information as it came and tried to remember it, 

3. felt focused, attuned to the discussion/felt scattered, had difficulty following 
discussion, ; 

4. accepted statements without questioning them/questioned validity and 
generality of statements, 

5. became emotional (distracted by irritation, frustration, pressure, and so 
on)/stayed calm and in control, 
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6. looked for principles or causes behind examples and details of 
behaviour/noted causes or principles, if presented, but did not search, 

7. formed mental pictures from information provided/did not form mental 
pictures, 

8. studied ideas in themselves without application to anything/thought about 
usefulness or significance of ideas, 

9. studied more leisurely/studied more intensely, and 

10. my understanding was due to my own efforts, what I made to happen/my 
understanding was due to the author’s presentation, level of vocabulary, 
mood, and other factors beyond my control. 


In this order, the study dimensions were aligned on a rating sheet, with five 
dots separating the poles of each dimension. The dots were labelled: definitely, to 
some extent, neither or in between, to some extent, and definitely. Subjects 
received 10 rating sheets, one for each section. They began with section one, 
skimmed the contents to cue their memory, and then, circled the dot for each 
dimension which best described how they studied that section. 


Organization Test. After reading the chapter and rating each section on 
aspects of studying, subjects completed a test of organization and then a test of 
recall. In the organization test, subjects were given an envelope containing small 
slips of paper, numbered from 1 to 33. On each slip of paper was written an idea 
from the chapter. Ten slips contained the names of different properties of reflexes 
and behaviour: latency, fatigue, inhibition, motor recruitment, antagonistic 
induction, after-discharge, summation, warm-up, response specific, and stimulus 
specific. The latter two designate types of fatigue. Fourteen slips contained 
examples of these properties such as: frog’s leg-wiping reflex, waning of chaffinch 
mobbing response, courtship of male Drosophila, mating of Praying Mantis, and 
the like. Nine slips contained more abstract phrases concerned with causal 
mechanisms and the nature of properties. For example, resistance to transmission 
in internuncial neurons designates a cause of fatigue, while integration of stimuli 
is a defining characteristic of summation. 


Subjects were told to think of an order in which to present these 33 items to 
someone else who knew nothing about the chapter. This ordering was 
accomplished in three steps. First, subjects divided items into groups that seemed 
to go together. Second, they ordered the groups in the way they would present 
them to someone else. Third, they ordered the items in each group. A discard pile 
was allowed for items they could not place. When completed, they recorded the 
ordering on an answer sheet with a column of numbers from 1 to 33. 


Recall test. The recall test was made up of 28 questions. Eleven questions 
asked for a definition of major terms; for example, what is reflex rebound, 
inhibition is , and the like. Ten questions required subjects to fill in the 
blank with the correct detail, cause, or explanation; for example, after-discharge 
can be lengthened and strengthened by -— ? Last, seven questions asked for 
detailed knowledge of examples; for example, when presented with an owl, what 
do chaffinch do? What are the three steps in the reaction of a newly-hatched 
cuttle-fish to a tiny shrimp? 

Treatment Conditions 


Control group. This group simply studied the chapter. 


Conceptual prequestion group. Before sections 3, 5, 6, 8, and 9, questions were 
inserted on a separate page. In writing at the top of the page, they were instructed 
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to use the following question as a study guide for the next section. The questions 
included the following: Compare aspects of latency in reflexes and complex 
behaviour. In what way or ways is summation in reflexes similar to summation in 
complex behaviour? Are these differences or potential differences? Is warm-up in 
reflexes the same as warm-up in complex behaviour? Or is there a difference? 
Contrast stimulus-specific fatigue and response-specific fatigue. Explain how the 
notion of inhibition in simple reflexes is extended to complex behaviour. 

Conceptual postquestion group. After sections 3, 5, 6, 8 and 9, the same 
questions were inserted. Written instructions on the inserted pages stated that 
they were not to look back nor take more than a minute to answer the question. 
To reinforce this instruction, they were informed in underlined type not to write 
out a full answer, but to list a few of the points they would cover if they had to 
write a full essay. They could return to any section after they had read the 
chapter all the way through. 


Factual postquestion group. After sections 3, 5, 6, 8, and 9, a series of detailed 
factual questions were inserted on separate pages. Subjects were instructed not to 
look back (until they had read the entire chapter) and to simply fill in the blanks. 
There were about four to five blanks to fill in for each section. The questions were 
factual rather than conceptual, involving memory for detail rather than 
organization of information. For example, two of the sentences for Section 3 were 
the following: When a tiny shrimp is presented to a newly-hatched cuttle-fish, 
there is no response for as long as ___ . The latency of reflexes is known to be 
due to delay in the transmission of ACTOSS tia 5 os 


Methodological organizer group. This group did not receive questions, but 
were provided with a framework for taking notes. Subjects were informed that the 
chapter used six properties of reflexes to show how reflexes and behaviour were 
similar. To make their task easier, presumably, they were presented with a scheme 
for organizing ideas in a comparative essay. First, there must be bases for 
comparison (e.g., the six properties). Second, there will be information about the 
subjects of comparison. Third, there is a result for the comparison, a judgment of 
the extent to which two or more things are similar or different. Fourth, there must 
be some sense of significance in the judgment, which can be derived from the 
author’s purpose. These four points were briefly explained and _ illustrated. 
Subjects were then provided with a sheet of paper with five columns. Above the 
columns were written the organizers: bases for comparison, information about 
reflexes, information about behaviour, result of comparison, and significance of 
comparison. Subjects were told that they could use this sheet to take notes or to 
just organize their thoughts as they read. 


Procedure. 

Students were tested in groups ranging from 7 to 20. The organizer group was 
tested separately, as they received unique instructions. The remaining groups were 
tested together. They were informed of the purpose of the study (i.e., to study 
how people learn from textbooks) and oriented to the tasks they would have to 
complete. 

The chapter was handed out and subjects were informed that some would have 
questions inserted. They were instructed just to follow the direction and that if 
they ran across questions, they were intended as study aids. Students were then 
given 35 minutes to learn the chapter. Immediately after studying the chapter, 
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they rated each section on the 10 study dimensions and then completed the 
organization test. After a break, they completed the recall test. 


Measures 

Study dimensions. The circled dots or ratings were transformed into numbers, 
ranging from five to one. 

Organization test. Organization scores ranged from 0 to 33. An item was 
counted if it was part of a well-formed group. A well-formed group was composed 
of a superordinate concept (e.g., latency, after-discharge, etc.) and subordinate 
examples, causes, and characteristics. Occasionally judgment was required, but, 
typically, students simply divided the items into six groups corresponding to the 
six major properties of reflexes. Consequently, in the future, it would be advisable 
to simply provide the superordinates in order to avoid having to make any 
judgments. However, in the present study, it was thought desirable to give 
students complete freedom in forming groups. If an item was correctly categorized, 
it was counted; if it was incorrectly categorized, it was not counted. 

The intent of this measure is to assess the extent to which students could 
organize information. Students often seem to study in a way that isolates 
information and requires only memorization. While memory is involved in this 
test, it also requires knowledge of the way information was structured. 

Recall test. Recall scores could vary from 1 to 28. If they answered a question 
correctly, it was counted. If they did not, it was not counted. Occasionally, 
judgments had to be made on these answers as well. 


Results 


The first question is: Are the self-report measures reliable? For each 
dimension, there were 10 ratings (one for each section). For each subject, the odd 
and even ratings were summed separately. A Spearman-Brown reliability 
coefficient (2r/1 + r) was calculated for each group. As can be seen from Table 1, 
these reliabilities tended to be high. 


TABLE 1 


ODD-EVEN RELIABILITIES OF STUDY VARIABLES FOR EACH GROUP 


Group Study Variables 

al 2 3 4 5 6 7 8 9 10 
Factual 
Poatquestion 75 86 61 93 91 onl 94 63 W7. 85 
Conceptual 
Peequestien syik O77, 90 97 . 96 97 ou Chit 78 98 
Control oi mor JTS -96 oS} O38! ee. ole 48 557) 
Conceptual 
Postquestion o)it 3} 94 85 noo -90 a0 94 5 8)5) -90 
Organizer no . 86 109 120 .89 noU -90 ie -90 oil 
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It might be noted that the reliabilities were not high due to any uniformity in 
ratings. Some subjects tended to rate each section in more or less the same way on 
some or most dimensions. However, most students showed medium to considerable 
variability. 

The second question is: Do these dimensions have predictive validity? As can 
be seen from Table 2, there were many significant correlations with organization 
scores and some with recall scores. With increased N, many more would have 
reached significance, as suggested by the overall correlations. Second, the 
correlations indicate variability across groups. Some treatment conditions seemed 
to highlight the importance of one variable for learning while other conditions 
highlighted another variable. For example, the factual postquestion group shows a 
strong, counter-intuitive correlation between imagery and organization scores. The 
more students used mental images, the worse they did. The less they used mental 
imagery, the better they did. One plausible explanation is that imagery which 
facilitates learning is interactive, combining diverse aspects into a unifying picture. 


TABLE 2 


CORRELATIONS OF STUDY VARIABLES WITH ORGANIZATION 
AND RECALL SCORES BY GROUP 


Study Variables 


oF 1 2 3 4 5 6 7 8 9 10 
Organization Scores 
Eaetual Pil 163 = 2h .18 =e 126 -.80** .10 .30 18 
Postquestion 
Beveerere> 19 52 18 ~.59% 02 .09 42 =.04 ~.67%* 55% 
Prequestion 
Control -.38 49% 666% = 48% -.62% 5 45 -.49% 32 ~.09 
epecr runs -.46 39 51% aes Mey 134 29 ay ~.55% 17 
Postquestion 
Organizer -.19 51% 55% £19 739 =. (0H .08 = 023 =,18 ul 
All Combined -.22% 44k .28%% =. 16 =. i169) .16 .10 = 18 Sapikes 2% 
Recall Scores 

acest af3 ol 6 36 E01 OF. 036 26 11 ~,05 
Postquestion 
Conceptual ~.25 32 45 ~.67%* SSaiitay 23 ails -.26 226 57% 
Prequestion 
Control =A35 56% £45 59) —.52* .32 .33 -. 38 .19 -.10 
poncept iam -.18 16 45 =;19 ay) a5” </95 -.30 -.05 Sah 
Postquestion 
Organizer 5510) .39 .30 .06 34 .00 41 -.44 -.19 -.14 
All Combined -.24% 31% 23% = itl) ~.25* sy .08 S27 09 sil) 

* p<.05 

*k p<.01 


Perhaps, the factual questions disposed students to memorize in an isolated rather 
than an integrative fashion, as they would have been encouraged to read for detail 
rather than structure. Correct or not, the strength of this correlation and its 
strong divergence from the same correlations for the control and prequestion 
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groups requires explanation. Perhaps the most fruitful view of these correlations is 
that treatment condition appears to affect the predictive validity of study 
dimensions. A third aspect of Table 1 is that the decreasing strength of 
correlations across groups suggests that treatment condition might disrupt or 
weaken the validities of dimensions. The control group had the strongest 
correlations and the factual group the weakest. It appears that the more intrusive 
an intervention is, the less predictive are these study variables. Perhaps different 
variables, however, would present another story. 


In summary, the self-report variables were reliable and, to varying degrees, 
predictive. This method of monitoring, consequently, receives support from two 
important types of evidence. 


In Table 3, the means and standard deviations of the ten study variables and 
the two tests are reported. An ANOVA was conducted upon each variable and test 
separately. Although some F values approached significance, none reached it. This, 
of course, does not mean that there were no differences. Had one predicted 
specific differences, some tests would have been significant. For example, one 
might have predicted that factual postquestions would dispose subjects to study in 
a more detailed and isolating fashion, to memorize rather than synthesize. On 
variable 2, a t test between the control group and the factual group would have 
been significant, however illegitimate the test is now, ¢t (24)=2.54, p<.02, 
two-tailed test. The sensitivity of the monitoring variables could probably have 
been increased by phasing the variables more concretely and specifically or by 
making sure they were appropriate for the treatment conditions. On the other 
hand, this might be the wrong type of data for impressive differences. That is, the 
treatment conditions might have presented a situation that polarized subjects on 
the variables presented. 


TABLE 38 


MEANS AND STANDARD DEVIATIONS 
OF STUDY VARIABLES AND TEXT SCORES 


Groups Study Dimensions Tests 
1 2 3 4 5 6 7 8 9 10 Organs Recall 
zation 

Control 

Mean PRS Sd 37.6 eH) 36.5 Pie) =i eRS) Ad. 25.8 27.9 30.9 Lies 12.5 

SD Ue) 8.2 8.0 9.4 O28 Clee! Thales G2) 4.6 ies 6.2 6% 3 
Prequestions 

Mean 30.9 Bea 33.9 35:69 PUY | 25RG 355.5 31.5 26.5 29135 10.9 10.1 

SD 10.6 £20 8.9 L220 8 12.2 10.2 10.4 6.8 Ohi 5 Died 
Postquestions 

Mean eieine! 29.4 28.5 38.6 28.8 26.8 33.8 PAS TT) Zo 29.6 iuSyS7/ £362 

SD a 11.4 LOn2 8.4 Wie Ley 10.9 8.7 Sry, by) 4.7 5.0 
Factual Questions 

Mean 32.9 29.6 HNC) 37.0 229 Ze 36.4 B2..7 30.5 29.8 nl) Uior: 

SD 7.8 7.9 Use) 8.2 110 8.4 9.4 8. 6.4 6.4 5.8 4.1 
Organizer 

Mean 29.03 36 .6 34.6 Soi. a 26.4 Pay ae} 36.4 34.2. 232 33.4 15.4 L246 

SD TYEE) a7 8.5 A ERS 8.3 9.6 9.6 10.0 TZ lie? 6.6 Dad 
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TABLE 4 


CORRELATIONS AMONG STUDY VARIABLES 


Variable Groups 

Pairs 

, Pre= Post- All 

Factual Question Control Question Organizer Combined 

1-2 -.53%* -.29 aero —69%%* -.32 —.46%*% 
1-3 -.42 -.37 = eS) —.90** -.//%* —.60** 
1-4 «1 2%* -.12 307 . 17 %* . /2%* -42%% 
1-5 oy we 51% 49% - 58% 48% 
1-6 —.64%% -.13 S525 —. 85%* -.34 —.44%% 
1-7 SGU) -.28 = oll@ -.63%* -.55% —.40%*% 
1-8 58% 46 52* . 15** 42 50** 
1-9 By 36 = 03 47 25 35%** 
1-10 —.81%*%* - .06 44 -.0O1 -.02 AOR) 
2-3 52% 75%* 23 . 86%%* 51* 62%* 
2-4 -.50%* -.36 14 -.54% -.24 —.29%* 
2-5 —.54% -.43 = gffishe -. 38 -.57%* —.45%x* 
2-6 87** 71%% 41 -68%% 25 59** 
2-7 Salts 53* 63% .65%*% 36 47k 
2-8 =.8* -.43 =n Ee -.55* -.32 —,43%%* 
2-9 =, 310) —.53% HOA —. 48% -.04 >. 39%% 
2-10 77%% 38 S602 -. 30 50 29%% 
3-4 Soh -.22 30 -. /1** -.38 —.40%% 
3=5 Sor —.70** = hy -.56%* =.61% —, 58%* 
3-6 56% 81** 13 . 19*% 34 53%%* 
S=/] 62% 73% 22 .66%** 20 49x*x 
3-8 16 (sks — .82%%* — .66%* —.6/7*% -.29 aol xx 
3-9 =, 7) -.29 40 -.44 -.1/7 —,28%% 
3-10 42 26 Sols) -.23 28 14 
4-5 LS =. 10 = 405 52) 54% 17 
4-6 ges 25 oi 05 —. 83%%* —. /2*%* —.47%* 
4-7 =o ita -.13 34 sti fs -.47% Sranaiaes 
4-8 62% 09 = eS) . /8** 33 30** 
4-9 34 04 ues) Bal 41 Ny 
4-10 —,68%%* Sant 305 noo -.05 —,26% 
5-6 471% 103 07 =n She! —. 36 —, 33%* 
5-7 ly —.66%*% eS 02 -.40 —,32%* 
5-8 67** 68%* ay? 30 38 44kx 
5-9 = 6.748) 10 (5) 07 39 Ol 
5-10 = pale 04 4210) 00 -.06 =e O2 
6a7 =, Le) 59% ang 54% 34 39%%* 
Aas — .74x* —,69%* a5 5* —.74%% —.48% —. 64%% 
6-9 ee ul Soil ~.46 SES) =. 30 —. 27% 
6-10 84k* 34 ar) 5513) 12 WS) 
1=8 6 Als) —.74%% A Sype —.65** —.52% —.54%* 
7-9 =o Ailes =o Si ollie) 5 25) O1 -.20 
7-10 = (0/8) Ae R02 SH Zo 05 
8-9 07 16 OH, Zo Le 10 
8-10 —. /4%* =e Ure - 40 ag IES) O1 
9-10 =As = 40 more = (07 Pai -.21* 


Table 4 records the correlations between variables for each group individually 
and for the groups as a whole. The question of interest, in perusing this table, is 
whether or not treatments influenced the way variables were related or put 
together to compose the varying experiences expected from different treatment 
conditions. Correlations can differ either in pattern or intensity. For example, the 
relation between interest (variable 1) and locus of control (variable 10) for the 
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factual group is different in kind or pattern. Correlations for other groups were 
either negligible or positive. Thus, for the factual group only, there is a strong 
relation between degree of interest and locus of control. That is, the more 
interested a subject was in the material, the more he or she attributed 
understanding to his/her own efforts. Consequently, in future studies, locus of 
control might prove to be an important variable in determining how subjects will 
react to factual postquestions. A difference in intensity by contrast, is based upon 
strength of correlations. For example, disregarding sign, we might consider only 
the strength of the correlations involving variable 10. Upon inspection, it is 
apparent that, for the factual group only, locus of control was strongly coordinated 
with other variables in composing the total experience. It tended to be rather 
negligible or circumscribed in other groups. What this might mean is that the 
factual condition highlights locus of control as a more central attribute of the 
experiential condition. That is, assuming an internal or external locus of control 
would have many more implications for positions assumed on other variables. 
While there are a number of interesting similarities and differences in this table, it 
would be tedious to elaborate further. However, the overall patterns and 
intensities of group correlations can be compared more directly. 


To compare the patterns of correlations, the columns in Table 4 were 
correlated. First, however, the negative signs were minimized; that is, by reversing 
the poles of variables 1, 4, 5, 8, and 9, the majority of correlations became positive. 
This was done to avoid artifactually inflating the resulting correlation. Second, 
correlations were squared and multiplied by 100 to make them additive, and 
consequently, suitable as scores. Signs were preserved in this operation. Table 5 
records the correlations among the sets of group correlations. The factual group 
tended to be isolated, having little in common with the other conditions. The 
prequestion group showed considerable similarity in correlational pattern with the 
control group. Both related more weakly to the conceptual postquestion group 
which, in turn, related more strongly to the organizer group. The least intrusive 
conditions resulted in a similar patterning of variables, the more intrusive 
conditions resulted in a similar patterning, and the factual emphasis resulted in a 
distinctive patterning. Thus it appears that treatment conditions were capable of 
altering the way variables were related. 


TABLE 5 


CORRELATIONS BETWEEN GROUP SETS OF CORRELATIONS 
AMONG STUDY VARIABLES 


Groups 
Groups 1 2 3 4 5 
1. Factual postquestion Xx 5 dU) -.04 .07 -.05 
2. Conceptual prequestion X 194 72D -.01 
3. Control X B59 oS) 
4. Conceptual Postquestion X sist 
5. Organizer xX 
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To compare the strength of correlations, signs were disregarded. Table 6 
records the median and quartiles for each set or correlations. As can be seen, the 
factual and postquestion groups manifested generally strong correlations relative 
to the organizer and control groups, with the prequestion group in the middle. For 
another perspective, the means and standard deviations for each set of 
correlations (squared and multiplied by 100) were: Conceptual postquestion, M 
=29.70, SD=23.59; factual, M=26.18, SD=20.84; prequestion, M=21.09, 
SD=20.75; organizer, M=15.69, SD=14.44; control, M=11.40, SD=11.35. 
Treatment conditions appear to have had a substantial effect upon the extent to 
which study variables were related. That is, the conceptual postquestion group 
studied in a tight, well-defined pattern, with nearly every variable operating in 
coordination with many others. By contrast, the control group studied in a looser, 
more ill-defined pattern. Another way of saying this is that members of the 
conceptual group showed ordered variation. The treatment condition tended to 
impose a pattern on the experience. Members of the control group showed more 
distinctiveness. There was much less pattern among members. What this suggests 
is that treatment varies in the extent to which it requires a uniform pattern of 
studying. 

Potentially, this contextualistic approach to treatment conditions is very 
important,as the evidence suggests that different treatments lead subjects to cue 
on different variables and to cue in divergent ways. For example, the factual group 
seemed to cue on locus control, which may in turn, have cued other variables. As 
another example, there was a suggestion (i.e., weak correlations) that the control 
group studied more leisurely when focused, attuned to the discussion, while the 
conceptual postquestion group studied more intensely when focused. The pattern 
and strength of interrelationships has implications for the quality of the learning 
experience and the quality and quantity of learning products. 


TABLE 6 


MEDIAN AND QUARTILE CORRELATIONS FOR EACH 
SET OF CORRELATIONS 


ee eee 


Group Ql Median Q3 

Postquestion 230 -49 -68 

Factual All 52K) Foz 

Prequestion 14 see 229 

Organizer 5745) . 34 -48 

Control ll} 5 By ~44 
Discussion 


This study is concerned with the potential usefulness of a method for 
monitoring learning experiences and activities in studying. To this end, the 
evidence demonstrates, first, that people judged their own activities and 
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experiences in a reliable fashion. Second, these self-reports were predictive of 
learning, particularly learning involving the organization of information rather 
than just recall of information. Predictive validity was particularly strong for the 
control group. Five of ten correlations with organizational learning were 
significant, and some others could probably be made significant with- higher 
number of subjects. This evidence supports the assumption that self-reports 
provide a reliable monitor of variables that are important for learning. 


A counter hypothesis might be that people have a reasonable idea of the 
amount of information they have learned. On this basis they might infer what 
they must have done, since at least some of the variables are clearly positive 
versus negative. However, this hypothesis does not take into account the 
variability of individual judgments. Students did not, in fact, rate sections in the 
same way. They varied, and many varied considerably, presumably in response to 
treatment, the material, and other factors. 


Study variables also appeared to be highly sensitive to treatment condition. 
Predictive validities varied according to treatment. The pattern and strength of 
correlations among study variables also varied according to treatments. Perhaps 
this is not sensitivity, but merely random fluctuation. For example, the average 
variance in common among correlations ranged from 11.40 for the control group to 
29.40 for the conceptual postquestion group, but this is not the sort of fluctuation 
one would expect from random samples. It constitutes a very large difference, 
indeed. 


Probably the fairest judgment of these data is that they support the possibility 
of using this type of monitor more widely in studies of textbook learning. The 
method cannot be adequately assessed in but a single experiment, but rather will 
require converging evidence from a number of experiments. To this end, the data 
from this beginning investigation are encouraging and interesting. 


From the hindsight of experience, however, a few pieces of advice might be 
appropriate. Among other things, a researcher might use this monitor to test 
hypotheses or to check to see if a treatment produced the effects that were 
thought to produce better learning. In this context, it would be advisable to use 
variables that are phrased as concretely as possible. It would probably be even 
better to break an abstract variable into two or more very concrete operations and 
sum these together. For example, variable 3 (felt focused, attuned to the 
discussion) could have been operationalized one way as the extent to which a 
person was aware of other people and things while reading. Another 
operationalization might involve the extent to which a person could or did follow 
the author’s points without having to review, putting effort into it, reaching an 
impasse, becoming confused, and the like. Second, a researcher would naturally 
want to select variables that tested a theory or were appropriate and important for 
distinguishing among treatments. (Incidentally, this was not a concern in this 
experiment. The variables were selected because of their general prominence in 
educational research). However, as a check upon target variables, it would be 
desirable to have variables that are irrelevant to treatment, not apt to be affected 
by varying conditions. This may not even be possible, but would be worth a try. 
Last, the wisdom of using bipolar variables may be questioned, in some 
experiments, because bipolarity might establish a positive/negative bias, or 
prejudge and restrict the variable under investigation. Bipolarity helps to clarify 
exactly what is being asked, but perhaps at too high a price for some types of 
research. 
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In summary, the self-report method appears to be a reasonable way to monitor 
what subjects are doing and undergoing when they are studying. In making 
explicit unobservable happenings, the method offers a way to test hypotheses, 
check bases for treatment effectiveness, provide a richer picture of learning or 
what is happening between the beginning and the end of experiments, and 
perhaps develop a phenomenology of learning experiences. 


This study was supported by an Academic Development Fund at the University of Victoria. 
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A Study of the Relationship Between 
The Job Satisfaction of Principals and 
The Perceived Level of Teacher Militancy 


Building principals are the incumbents of leadership positions in_ the 
educational organization and are vital to the success of their clients. This study 
assessed whether there were significant differences in the level of job 
satisfaction, autonomy, esteem, and self-actualization experienced by principals 
who perceived a high level of teacher militancy and those who perceived a low 
level of teacher militancy in their respective schools. Differences in the level of 
teacher militancy perceived by principals in urban, rural, and suburban schools 
were also investigated. In addition, the study explored whether principals 
perceiving either a high or a low level of teacher militancy had significant 
differences in their preference for a type of organization to represent their 
rights and interests. Statistical tests utilized included the analysis of variance, 
chi square, and a factor analysis appropriate to the evaluation of the 
hypotheses. 


Research findings suggest that the increasing militancy of teachers and the 
growing strength of teacher organizations have diluted the authority of the 
building principals and eroded their managerial powers (Wildman & Perry, 1966; 
Barea, 1977; Kline & Thomas, 1974; Wildman, 1964). The major objective of this 
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study was to examine how the perceived level of teacher militancy relates to the 
level of job satisfaction of the building principal. Job satisfaction is defined as the 
measure of one’s need fulfillment in the workplace. The influence of mediating 
factors, including grade level, and other organizational variables and demographic 
characteristics, was studied. Whether teachers and principals both sought to fulfill 
their higher order needs (social, esteem, and self-actualization) was also assessed. 
Thirdly, data regarding the desire of building principals to have their rights and 
interests represented by various types of organizations or through a management 
team were also investigated. 


Pertinent Research Literature 


Getzels and Guba suggested that conflicts related to the fulfillment of a role 
may arise in three ways: as a result of disagreement within the referent group 
defining the role; disagreement among several referent groups with each having a 
right to define expectations for the same role; and contradictions in the 
expectations of two or more roles which an individual may occupy at the same 
time (Getzels & Guba, 1957). Recognizing the importance of the building principal 
in the school’s formal hierarchical organization, it is essential that the incumbent 
of this position assume a leadership role. Leadership as described by Lipham 
involves “the initiation of a new structure or procedure for accomplishing an 
organization’s goals and objectives” (Lipham, 1964, p. 22). 


The expansion of public employee unionism has been a rapid and extensive 
post-World War II phenomenon. The emergence of public employee unionism has 
been attributed to a variety of factors such as supportive legislation, union 
competition for membership, rampant inflation, increased demand for government 
services, and the publicity given to union gains and success in the private sector 
(Cohany & Dewey, 1972). More specifically, teacher militancy, that is the 
aggressiveness of teachers to bargain for their collective good, has been attributed 
to the drive for professionalization and autonomy culminating in a greater voice in 
school decision-making; the desire for more money and benefits; a defensive 
reaction to widespread criticism and disenchantment with massive student 
underachievement; the NEA/AFT rivalry; the increasing percentage of males in 
the teaching force; and the era of political activism in which collective action, 
demonstrations, and thrusts for power have been both prevalent and effective 
(Perry & Wildman, 1970). Furthermore, research by Giandomenico (1973) found 
that teachers’ inability to obtain a desired amount of neec satisfaction from their 
jobs also served as a catalytic force for aggressive collective action. Interestingly, 
the teachers who exhibited the greatest degree of militancy were less concerned 
with financial gains than with autonomy, esteem, and self-actualization. 

It has been suggested that as a result of teacher militancy and unionization, 
building principals have experienced a loss of their administrative freedom to 
exercise judgment since the scope of teacher negotiations now includes such 
traditional areas of administrative discretion as class size, promotions, teacher 
assignments, transfers, nonteaching duties, and the length of the school day. 
General agreement exists among administrators that boards of education and 
teacher organizations, through collective agreements, have provided teachers with 
increased decision-making powers (Yeakey & Johnston, 1979) and some building 
principals apparently agree that their discretionary decision-making power to 
manage, operate, and direct has been severely eroded (Barea, 1977). Related 
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research by Butkiewicz (1973) assessed the impact of collective negotiations on 
secondary school principals and concluded that the functional role which was 
altered most by professional negotiations was the personnel management function. 
Butkiewicz noted that negotiated agreements had, in fact, diminished the number 
of unilateral decisions a principal could make in relation to policy development at 
the school level. Also, principals were forced to rely on input from a variety of 
sources before reaching a decision. 

Operationally speaking, the initial impact of the negotiated agreement falls 
most heavily on the building principal, for it is at the building level that the 
primary responsibility for managing and adapting the school organization to the 
new power alignment resides. Research on factors affecting the development of 
worker participation in management has shown that managers resent having their 
decision-making power diluted by joint consultation or shared decision-making 
(Derber, 1970). On the other hand, however, teacher organization advocates 
counter that the militancy of teachers represents a search for professional control 
over their own areas of expertise. 

Within the educational workplace, both teachers and principals seek to fulfill 
their needs and it is the gratification of those needs which result in an individual 
feeling positive about self. The traditional theory guiding research investigations 
on job satisfaction has been that “differences in job satisfaction reflect differences 
in the nature of the jobs or work situations of individuals” (Vroom, 1964, p. 172). 
Research on variables that affect the building principal’s job satisfaction has 
demonstrated that the greater the principal’s autonomy and role specificity, the 
greater his or her job satisfaction (Gross & Napior, 1967). However, few studies 
thus far have dealt with variables that have become highly visible with the 
evolvement of collective bargaining for public school teachers. 


Maslow proposed that the chief principle of organization in human 
motivational life is the arrangement of basic needs in an hierarchy of potency 
(Maslow, 1970). He postulated that as individuals satisfy their lowest level of need 
satisfaction, they are driven to a new level of satisfaction. Maslow assumed that 
human needs evolve in a predictable manner beginning with the lower order 
physiological and safety needs and progressing to the higher order social, esteem, 
and self-actualization needs. Physiological needs include homeostasis, hunger, sex, 
and thirst, while safety needs include security, stability, and freedom from fear 
and chaos. Social needs encompass love, affection, and belongingness, whereas 
esteem needs include the desire for achievement, independence, status, and 
recognition. Self-actualization encompasses needs- of _ self-fulfillment, 
self-enchancement, and accomplishment as well as a desire to become all that one 
is capable of being. 

Omitting the physiological needs, Porter adapted this hierarchy for the study 
of needs-satisfaction among management personnel. Porter’s needs hierarchy 
included security, social, esteem, autonomy, and self-actualization (Porter, 1961). 
The autonomy level includes authority, opportunity for independent thought and 
action, participation in goal setting, and participation in the determination of job 
methods and procedures. Porter concluded that, at various managerial levels, the 
largest frequencies of need fulfillment deficiency occurred in the higher order need 
areas of esteem, autonomy, and self-actualization (Porter, 1962). 


It is significant to note that although there is research analyzing individual 
need hierarchy and need satisfaction among management personnel, research, to 


304 


Job Satisfaction of Principals 


date, has not assessed variables that affect building principals’ job satisfaction 
since the evolvement of collective bargaining for public school teachers. In essence, 
there is little existing knowledge concerning the relationship between collective 
negotiations and the job satisfaction of building principals. Both Maslow and 
Porter assumed that a general need structure was universally distributed among 
the human populace. The fact that school principals, as middle management 
administrators, perceive a loss of their administrative prerogatives may impact 
negatively on the job satisfaction of these administrators. To assess the effect of 
teacher militancy on the building principal’s job satisfaction, the following 
hypotheses were formulated: 

1. There is no significant difference between the level of job satisfaction of 
principals who perceive a high level of teacher militancy in their buildings 
and principals who perceive a low level of teacher militancy in their 
buildings. 

Two sub-hypotheses were constructed to assess differences in principals’ job 
satisfaction as a function of their employment in a specified community type or 
instructional level. 
la. There are no significant differences among the levels of job satisfaction of 

principals in rural, urban, or suburban schools. 

lb. There are no significant differences among the levels of job satisfaction of 
principals of elementary, middle/junior, or high schools. 

2. There is no significant difference between the esteem score of principals who 
perceive a high level of teacher militancy in their buildings and principals 
who perceive a low level of teacher militancy in their buildings. 

3. There is no significant difference between the autonomy score of principals 
who perceive a high level of teacher militancy in their buildings and 
principals who perceive a low level of teacher militancy in their buildings. 

4. There is no significant difference between the self-actualization score of 
principals who perceive a high level of teacher militancy in their buildings 
and principals who perceive a low level of teacher militancy in their 
buildings. 

5. There is no significant difference in the preference of principals who perceive 
a high level of teacher militancy in their schools and principals who perceive 
a low level of teacher militancy in their schools with regard to having their 
interests and rights represented by a management team, an administrators’ 
association, or an administrators’ union. 

6. There is no significant difference between the levels of perceived teacher 
militancy among principals from urban, suburban and rural communities. 


Methodology 


A stratified random sample of 45 building principals was selected from three 
counties in the northeastern region of the United States. The three counties were 
selected because they geographically provided a cross-section of rural, suburban, 
and urban subjects pertinent to the study. The sample was stratified to ensure 
that 15 of the participants would be from each of three community type 
designations (rural, suburban, and urban). The sample was further stratified to 
ensure representation of building principals from elementary (K-5, K-6), 
middle/junior (6-9), and high school (9-12, 10-12) levels. Ninety-one percent of the 
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building principals were male with 9.5 average years of experience. The average 
years of service in their current position was 6.6 years. Principals’ rights and 
interests were represented through a variety of media including unions, 
professional administrators’ associations, and management teams. 


Instrumentation 


Three instruments, the Minnesota Satisfaction Questionnaire, the Attitudinal 
Militancy Scale, and a personal data form were used. The instruments were 
selected because of their high level of reliability, their content validity, and the 
efficiency with which they could be administered. 


The Minnesota Satisfaction Questionnaire (Weiss, Davis, England, Lofquist, 
1967), a 100-item self-administered instrument, was designed to measure 
satisfaction with specific aspects of work and work environments. The following 
choice among five responses was provided on a scale of 1 to 5 for each item: (1) 
Very dissatisfied; (2) Dissatisfied; (3) Neither satisfied nor dissatisfied; (4) 
Satisfied; and (5) Very satisfied. The alpha coefficient of reliability was computed 
to be 0.86. 


The Attitudinal Militancy Scale (Wohnsiedler, 1975) was used to measure the 
level of teacher militancy in those schools where collective bargaining had been 
implemented. The following choice of five responses was provided for each item on 
the scale: (1) Strongly agree; (2) Agree; (3) Undecided; (4) Disagree; and (5) 
Strongly disagree. A _ reliability coefficient of .93 was established for the 
instrument by use of the alpha coefficient. Directions for the completion of this 
scale by classroom teachers were modified for use by building principals in their 
assessment of the perceived level of teacher militancy in their schools. 


Statistical tests included the analysis of variance, chi square and a factor 
analysis appropriate to the evaluation of the hypotheses. The hypotheses’ testing 
required the dichotomization of the principals into those who perceived a low level 
of teacher militancy and those who perceived a high level of teacher militancy. 
These two groupings were established by using the mean teacher militancy score 
as cut-off point. A factor analysis of selected items from the Minnesota 
Satisfaction Questionnaire was conducted for the purpose of data reduction and to 
attempt identification of those items that would be related to the constructs 
esteem, autonomy, and self-actualization as defined by Porter. 


Findings and Analysis 


As Table 1 shows, the general job satisfaction scores of building principals who 
perceived a high level of teacher militancy and those who perceived a low level of 
teacher militancy were not found to differ in a statistically significant manner. 
Thus, the first hypothesis was supported insofar as the statistical analysis of the 
data demonstrated that the difference in mean scores could be due to chance 
alone. The failure to establish a significant difference between the general job 
satisfaction scores of the two principals’ groups may be explained by viewpoints 
and conclusions cited in the literature. Butkiewicz (1973) found that many 
secondary school principals saw negotiated agreements with teacher organizations 
as having a positive effect on key areas of instructional leadership including 
teacher supervision, teacher inservice, and curriculum improvements. Although the 
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principals felt that their jobs were more difficult because of their responsibility in 
implementing the terms of such agreements, Butkiewicz surmised that these 
added responsibilities may have increased the respect and prestige of the 
principal. Thus, although principals may have lost certain managerial prerogatives, 
they now enjoy the clarification of rights and responsibilities that have resulted 
from comprehensive written agreements and such clarification has served to 
strengthen the job satisfaction of the principal (Gross & Napior, 1967). 


TABLE 1 


SUMMARY DATA AND ANALYSIS OF VARIANCE FOR THE 
RELATIONSHIP BETWEEN GENERAL JOB SATISFACTION 
AND PERCEIVED TEACHER MILITANCY 


Group n M SD 
Low Militancy 24 (5403 6.93 
High Militancy Zi Viens © UB} iS) 

Degrees Sum 
of of Mean PF 

Source Freedom Squares Squares Ratto 
Between Groups al 26.86 26.86 0.36% 
Within Groups 43 3178.28 73.91 
Total 44 3205.14 


* not significant p<.05 


TABLE 2 


SUMMARY DATA AND ANALYSIS OF VARIANCE FOR THE 
RELATIONSHIP BETWEEN GENERAL JOB SATISFACTION 
AND COMMUNITY TYPE 


Group n M SD 
Urban 15 76.00 8.09 
Suburban WS 78.60 8.28 
Rural 5) 12607 OS i/ 

Degrees Sum 
of of Mean Hy 
Source Freedom Squares Squares Ratto 
ee ee ee e———— 

Between Groups 2 100.61 50.31 0.68% 
Within Groups 42 3104.53 Ps8? 
Total 44 3205.14 


A ge ee ee ee ee ee ee 


* not significant p<.05 


307 


G. S. Johnston, C. C. Yeakey and R. A. Winter 


In regard to the differences between the general job satisfaction scores of 
building principals from urban, suburban, and rural communities, hypothesis la 
was supported. However, as reflected in the summary data in Table 2, the 
difference in scores was found to be not statistically significant. It was anticipated 
that the principals’ levels of job satisfaction would improve as one progressed from 
the urban to the rural community type. Hypotheses 1b was also supported 
although the general job satisfaction scores of principals from the three specified 
levels did not differ significantly as shown in Table 3. The results of hypotheses 
la and 1b differ from Mifflin’s research (Mifflin, 1975) which demonstrated that 
urban school principals experienced a much lower level of job satisfaction than 
their suburban and rural counterparts. Similarly Mifflin’s finding that elementary 
school principals enjoyed a much higher level of job satisfaction than secondary 
school principals was not replicated by our study. Mifflin’s views, however, may 
reflect a regional bias and be inappropriate for generalization to areas outside of 
the sample selected. 


TABLE 3 


SUMMARY DATA AND ANALYSIS OF VARIANCE FOR THE 
RELATIONSHIP BETWEEN GENERAL JOB SATISFACTION 
AND INSTRUCTIONAL LEVEL 


Group n M SD 
Elementary 1S) UsyoO10) 8.10 
Middle/Junior Is 76.00 eee al 
High ‘Ie 78.67 ea 
Degrees Sum 
of of Mean BF 

Source Freedom Squares Squares Ratto 
Between Groups 2 MO 7exohil Sock On7 3% 
Within Groups 42 30917 39 O3% 75 
Total 44 3205.14 


* not significant p<.05 


It is important to note here that the view is often propagated that school 
professionals working in the urban school environment are disheartened, 
disillusioned and unfulfilled, that their professional needs are not being met under 
the weight of awesome social, political, educational, and economic impingements 
on-their work. On the contrary, our study has shown no significant difference in 
principals’ job satisfaction in either community type. What perhaps has evolved is 
a new type of urban school administrator who has the experiential background 
and unique expertise to work with a diversified populace. As research by 
Johnston, Yeakey and Holden (1979) has indicated, the turmoil of the sixties 
demanded and secured a new kind of administrator who, far from being 
overwhelmed by urban school dynamics, was creatively challenged by them. The 
resultant impact on job satisfaction is obvious. 
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The data in Table 4 illustrate that, on the average, the esteem factor scores of 
building principals who perceived a high level of teacher militancy in their 
buildings were not significantly different from the esteem factor scores of building 
principals who perceived low levels of teacher militancy. The second hypothesis 
was therefore supported. Several reasons might account for this outcome. Building 
principals may have found alternative means of fulfilling their esteem needs, 
irrespective of the level of teacher militancy in their schools, or the higher order 
esteem needs of principals may be either latent or unfulfilled because of the 
prepotency of lower order needs. Also, perceived teacher militancy might have 
little or no impact on the level of esteem experienced by the building principal 
since militant teachers and their organizations are, at least initially, concerned 
with salaries, benefits, and other lower order needs. Our finding might also be 
explained by the opinions expressed by a high percentage of principals who 
indicated support for teachers’ right to bargain collectively. Such favourable 
attitudes may reflect the principals’ desire for similar recognition or reflect a 
measure of empathy since all principals were former teachers. 


TABLE 4 
SUMMARY DATA AND ANALYSIS OF VARIANCE 


FOR THE RELATIONSHIP BETWEEN ESTEEM 
AND PERCEIVED TEACHER MILITANCY 


Group n M SD 
Low Militancy 24 34.83 Lela) 
High Militancy 21 36.14 6.24 
Degrees Sum 
of of Mean F 
Source Freedom Squares Squares Ratto 
Between Groups i S20 USN PAG) 0.65% 
Within Groups 43 1280.00 DS) ho 
Total 44 UW PASIS) M0) 


* not significant p<.05 


As shown in Table 5, the autonomy factor scores of the building principals who 
perceived a high level of teacher militancy were not significantly different from 
the autonomy factor scores of building principals who perceived lower levels of 
teacher militancy. The third hypothesis was therefore supported. These results 
encourage further investigation as to whether the building principals’ autonomy 
need is being fulfilled. Similar to our discussion of the second hypothesis finding, 
it might be that militant teachers and their organizations have been concerned 
primarily with lower order needs, salaries, fringe benefits, and have not yet 
infringed upon the managerial prerogatives of the building principals at the high 
order need level. Willower’s (1975) research of militant and non-militant districts 
with regard to teacher organization influence on decision-making is supportive of 


this position. 
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TABLE 5 


SUMMARY DATA AND ANALYSIS OF VARIANCE 
FOR THE RELATIONSHIP BETWEEN AUTONOMY 
AND PERCEIVED TEACHER MILITANCY 


Group n M SD 
Low Militancy 24 46.96 4.96 
High Militancy 2 46.38 Oesy2 
Degrees Sum 
of of Mean F 

Source Freedom Squares Squares Ratto 
Between Groups ii ofS} SaUo Oe as 
Within Groups 43 1413590 32588 
Total 44 1417.64 


* not significant p<.05 


Analysis of the data in Table 6 shows that the self-actualization factor scores 
for the two groups of building principals were not significantly different. The 
fourth hypothesis was also supported. This finding may reflect, as did our finding 
with the second and third hypotheses, that teachers and their organizations have 
not interfered with the approaches used by a building principal to fulfill his higher 
order self-actualization needs. Teacher unionism, still in an embryonic stage of 
growth in the schools sampled, may be primarily concerned with fulfilling the 
lower order needs of their membership. 


TABLE 6 


SUMMARY DATA AND ANALYSIS OF VARIANCE FOR THE 
RELATIONSHIP BETWEEN SELF-ACTUALIZATION 
AND PERCEIVED TEACHER MILITANCY 


Group n M SD 
Low Militancy 24 48.83 4.04 
High Militancy PAL 49.00 Ore) 

Degrees Sum 
of of Mean PF 

Source Freedom Square s Squares Ratto 
Between Groups iL OF 338 Ones 0.01* 
Within Groups 43 IOS) 5 S33} 41.14 
Total 44 1769.66 


ee ee a ee oT re weet ae eee See ee eee ee er Pee 
* not significant p<.05 


360 


Job Satisfaction of Principals 


While it is true that principals view the negotiated agreement as limiting their 
discretionary powers, they also feel that negotiated agreements increase their 
duties and responsibilities. Almo (1974) reported that 86 percent of the principals 
included in his study believed that collective bargaining was making their lives 
more difficult. However, Almo also found that principals saw themselves as 
capable individuals with the ability to find the appropriate means necessary to 
carry out their job responsibilities effectively. The inherent opportunity for growth 
and the challenge in successfully executing new and varied duties may have 
contributed to a feeling of accomplishment and self-fulfillment. The intricacies 
and complexities resulting from contractual stipulations may have provided the 
type of creative growth experiences necessary for self-actualization and 
self-enchancement. 


As Table 7 shows, principals’ preference for a particular structure for the 
representation of their own rights and interests was not significantly different 
whether the principal perceived a high or low level of teacher militancy. The fifth 
hypothesis was supported. Perusal of the data regarding the preferences of 
participating principals for representation of their rights and interests through 
specified structures revealed strong support for both the professional organization 
and management team structures. It appeared that principals were not strongly in 
favour of establishing unions, although professional associations representing 
building principals have adopted many union practices. 


TABLE 7 


CHI SQUARE SUMMARY DATA FOR THE RELATIONSHIP BETWEEN 
PRINCIPALS’ PREFERENCE FOR TYPE OF REPRESENTATIVE 
STRUCTURE AND PERCEIVED TEACHER MILITANCY 


Type of 


Representative Level of First Second Third Total for 
Structure Militancy Preference Preference Preference Other Each Group 
Management High 7 *GiS'.6%) 9 (20.8%) Bye Gib lee by 4) 0 (0.0%) va 
Team @ Low 12 (26.7%) 8 (17.8%) 4 ( 8.9%) 0 (0.0%) 24 
Union b High 2 ( 4.4%) iG PGES) TCS) EG tein) 4)) 21 
Low iD IC 225754) 8) (76) 17 (37.82) SWE (a7) 24 
Professional High 11 (24.4%) 9 (20.0%) a RGR 0 ( 0.0%) 21 
Association © Low 9 (20.0%) 13 (28.9%) 1 Qe IC 262%), 24 
Other 4 High h C2.00) 2 ( 4.4%) pen Fes 4 17 (37.8%) 21 
Low PAA Hsin) 0. (CO.10%) 2 Grae) 20 (44.4%) 24 
@ y2 = 1.29 
Dy? = 1.57 
@ x4 = 1.73 
Oh Cs Pe! 


The data in Table 8 reflects that the perceived level of teacher militancy was 
significantly different (p<.05) for principals in urban, rural, and suburban 
community types. Hypothesis 6 was therefore rejected. This finding demonstrates 
a correlation between the perceived level of teacher militancy and the community 
type. Moore’s (1974) research supported this finding and proposed that the level 
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of teacher union membership was related to the general level of organized labour 
strength in a region. Urban areas with far greater private sector union 
organization than suburban or rural comunities, also experience far greater public 
sector union organization. 


TABLE 8 


SUMMARY DATA AND THE ANALYSIS OF VARIANCE FOR THE 
RELATIONSHIP BETWEEN PERCEIVED TEACHER 
MILITANCY AND COMMUNITY TYPE 


Group n M SD 
Urban nS 135.47 Bede 
Suburban is: 124.20 Ia 
Rural We) 110.87 PUB GSI) 

Degrees Swn 
OUP of Mean F 

Source Freedom Squares Squares Ratto 
Between Groups 2 4549.35 2274.68 4.52% 
Within Groups 42 ZAMS B82: 503.19 
Total 44 25683.17 


* significant p<.05 


Situational Data 


Data demonstrated that teachers in 37 (82.2%) of the school buildings were 
represented by a regional affiliate of the National Education Association, 6 
(13.3%) were represented by the American Federation of Teachers, and 2 (4.4%) 
cited other representation without additional clarification. 


Forty-seven percent of the building principals were represented by professional 
associations. An additional 24 percent claimed representation through a 
management team. Eighteen percent of the building principals indicated that they 
were represented by unions. 


The building principals had a range of experiences with regard to teacher 
militancy: 80% of the principals had experienced an informal grievance, 69% had 
dealt with formal grievances, 35% had experienced some type of teacher job 
action. Of the ten principals that had experienced strikes, 80% of them were from 
urban districts. 


It is interesting to note that building principals were involved in the teacher 
negotiations process. Viewpoints of 69% percent of the building principals were 
represented indirectly in the negotiations process by their superintendents. The 
remaining 31 perent of the respondents had direct participation in teacher 
negotiations through review of the proposed contract prior to ratification or 
through actual representation at the negotiating sessions by a principal from the 
district. This participation in the negotiations process, far from lessening the 
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principals’ managerial prerogatives, may have indeed enhanced them and 
significantly influenced the outcome of this study. 


Summary 


In summary, building principals appeared to be fulfilling their needs in 
districts where a high level of teacher militancy was perceived and in districts 
where a low level of teacher militancy was perceived. In essence, the aggressive 
drive by teachers for their collective good did not significantly affect the job 
satisfaction of the building principal. These results might have been influenced by 
clarification of roles provided by comprehensive master contracts, by the success 
principals have enjoyed in coping with new administrative challenges, and because 
teacher organizations may have been more concerned with lower order needs such 
as salaries and fringe benefits than with higher order needs (autonomy, esteem, 
self-actualization). Perhaps in the fulfillment of the higher order needs, teachers 
would more directly challenge the administrative prerogatives of the school 
principal. The experience of the principals (as well as their direct and indirect 
involvement in the negotiations process) might also have influenced the study’s 
findings. 

Our findings suggested that teacher militancy may have had little impact on 
the job satisfaction of principals because the teacher organizations might have 
been more concerned with salaries and fringe benefits than issues directly 
influencing school policy and decision-making. Teacher unions in their nascency 
and embryonic stages are initially concerned with grievance procedures, 
recognition, salary, fringe benefits, organizational rights, impasse resolutions, 
extra-pay duty, and specific working conditions. But the practical effect of 
collective bargaining has been to grant teachers a greater measure of control over 
decisions of management. In those school districts which have been negotiating for 
twelve or more years, the teacher unions have been highly successful in expanding 
the scope of bargaining to include educational policy issues heretofore settled by 
the unilateral preserve of school administrators (Yeakey & Johnston, 1979). Thus, 
we suggest that the age or longevity of the teacher organization might have an 
impact on the job satisfaction of school principals. Again, the older and more 
mature teacher organizations have greatly expanded their settlements to the 
school policy arena, which would directly impinge upon the principals’ managerial 
latitude. Because our study did not control for the age or longevity of the teacher 
unions, a comparative study which would take into account that variable might 
yield some interesting results. Therefore, should this study be replicated, we would 
recommend that a contrastive study be conducted among principals in districts 
which have ‘been involved in collective negotiations for at least fifteen years and 
among school principals in districts which have been involved in teacher 
negotiations for no more than four years. 

Moreover, we note that teacher organizations which have been successful in 
expanding the scope of negotiations are not only older but more powerful as well, 
irrespective of state laws which might limit the scope of bargaining. For example, 
New York City, in addition to the legal jurisdiction of the state collective 
bargaining statute, the Taylor Law, has a city law that restricts the scope of 
negotiations, the New York City Administrative Code 1173-4.36. Yet, New York 
City’s United Federation of Teachers, which comes under this law, is considered to 
represent the epitome of teacher union power and strength in the policy and 
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decision-making arena (Cheng, 1976). Research could attempt to assess those 
variables which make certain teacher organizations relatively more powerful than 
others. A companion study could then analyze the job satisfaction of principals 
who are employed in schools and school districts that have varying degrees of 
teacher union power and strength. 


Finally, a new development is being evidenced in teacher negotiations today 
that relects the financial stringencies of most school districts. The escalating costs 
of basic school functioning and the poor financial condition of most school 
districts may preclude higher wages, salaries, and other lucrative monetary gains 
for teachers. As the economic picture becomes more bleak, we foresee the time 
when school districts, having little monetary reserve, will begin to bargain away, 
out of necessity, issues directly impinging upon managerial rights. The ensuing 
effects of this development on the job satisfaction of principals, in particular, and 
overall school management, in general, might generate some highly useful research 
findings with dire implications for all involved in the schooling process. 


This paper was presented at the 1980 annual conference of the American Educational Research Association in Boston, 
Massachusetts in April, 1980. 
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Children’s Difficulties with Base Ten Numeration 
Simulated for Prospective Elementary Teachers 


In preservice course work dealing with mathematics methods, it is often difficult 
to motivate students to think seriously about and to be appreciative of the 
subtlety of beginning number work in the early grades. The present study, by 
using a specially designed nondecimal numeration system, attempted to 
simulate difficulties that grade school pupils may have in learning base ten 
numeration. Using a pretest-posttest design with experimental and control 
groups, significant effects in favour of the simulation were found. Interview 
data were collected and reported in an attempt to explore further the impact of 
the simulation as revealed in the significant pre-post differences. 


Teaching students something they know little about is a difficult task; 
teaching them something about which they know everything is an even more 
difficult task. Whatever truth resides in the above statement forms the motivation 
for the present study. 


In teaching mathematics methods courses aimed at the general student in an 
elementary school teacher preparation program, the authors repeatedly 
encountered resistance, unconscious or otherwise, from students who were asked to 
devote their minds to activities concerned with basic concepts of base ten 
numeration. The students are so thoroughly familiar with the system that 


Thomas P. Atkinson and Daiyo Sawada are professors in the Department of Elementary Education at the University of 
Alberta, with responsibilities in the provision of the mathematics education component of preservice teacher preparation. A 
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search beyond the obvious in understanding the basic processes of mathematics. 
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everything is automatic, so automatic that no thought needs to be expended. It is 
thus very difficult for such students to become aware of any problems grade 
school pupils may have in understanding numerals. Even working with elementary 
school pupils having numeration difficulties is insufficient to bring home the basic 
complexity of thought involved in the numeration system, for the difficulty is 
frequently attributed to the child, not to the complexity of the system. The basic 
problem is neatly summed up by Gattegno’s phrase, “the obvious escapes.” This 
phrase encapsulates the idea that students think they know everything there is to 
know about the counting numbers, the names we say, and the symbols we write, 
when in fact a little probing soon reveals that they have little or no insight into 
the system. The obvious has escaped. For the most part, students use the system 
by rote, have learned it by rote, and unfortunately are likely to teach it by rote in 
the same manner unless they come to realize that the obvious has eluded them. 


Among several approaches employed by the authors to help students recapture 
the obvious is the use of a specially constructed numeration system. A nondecimal 
numeration system with completely new numerals and number names is outlined 
for students. They are then requested to develop the system further so that in all 
fundamental ways it behaves like the base ten system they know. One of their 
initial tasks is to write down the first few counting numbers. The first double digit 
numeral provides untold difficulty for more students than one would expect. Such 
difficulty undergirds the basic assumption of the simulation used in the present 
study: when students are let to struggle (and struggle is the correct word) with the 
continued construction of a new numeration system, the difficulties they 
encounter sensitize them to be more alert to and appreciative of the problems 
children have experienced with base ten numeration. Such difficulty leads them to 
recapture the obvious. 


The above strategy for recapturing the obvious has no doubt been tried by 
many mathematics educators in preservice courses. To our knowledge, however, it 
has been formally assessed and reported upon on only two occasions. Hamilton 
(1961) developed and used a base six system with completely new symbols and 
names. Using a pretest-posttest design involving experimental and control groups, 
he obtained significant differences in favour of the base six materials over the 
base ten materials on a 47-item numeration test covering regular base ten 
arithmetic. A four-year follow-up questionnaire revealed that the prospective 
teachers (now regular classroom teachers) still considered the original base six 
experience as very worthwhile and recommended its continued usage in the 
preservice program. Six years later, Haukebo (1967) used three different bases 
(octagonal, binary, and duodecimal) with prospective elementary teachers over a 
three-week period. In contrast to Hamilton, he did not obtain results supportive of 
the use of nondecimal numeration as a means of enhancing understanding of base 
ten numeration. Given that Hamilton did not achieve significant results until his 
second experiment, there seems to be need for caution in making blanket 
recommendations concerning the use of nondecimal numeration activities in 
mathematics methods courses. Accordingly, it seemed appropriate and timely to 
examine again the role of other bases in work on numeration for prospective 
teachers. 
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TABLE 1 


A COMPARISON OF THE HAMILTON AND SAWADA/ATKINSON 
NONDECIMAL NUMERATION SYSTEMS 


eer 


Hindu-arabic Hamilton Sawada/Atkinson 
numeral numeral name numeral name 
0 =) um © zip 
1 ae la ! alt 
2 ia) le [Nive bx 
3 NE ipl 7x cor 
4 ao) lo oO dat 
5 wr lu iS) (alt) ennl 
6 Ale lum ligt (alt) enn alt 
7 CRs hs 1 A (ait) enn bik 
8 ae () le lum } 4a (alt) enn cor 
9 wtf (Ne 1i lum [}, a (alt) enn dat 
10 akg ockon WO alec 
all ae an lu lum iN bik enns alt 
12 MO © - tesy ANA vik enns bik 
13 @ sa lesy-la ‘A A bik enns cor 
14 (\ iA lesy-le A 0 bik enns dat 
15 () AG lesy-li Ao cor enns 
16 Cpe lesy-lo je I cor enns alt 
17 A SE lesy-lu QO A cor enns bik 
18 oe eS lisy OD cor enns cor 
19 Ab. v8 lisy-la aa cor enns dat 
20 if (Or lisy-le Q© dat enns 
21 \ F lisy-if Ol dat enns alt 
22 a ale) lisy-lo Oo A dat enns bik 
23 an apy lisy-lu MA dat enns cor 
24 Le losy O00 dat enns dat 
25 Ab af losy-la 10 (o) (alt) fiv 
26 ac ia losy-le 12Ovl (alt) fiv alt 
27, Ry Yi Oeieey-1t | O% (alt) fiv bik 
28 ty 1s losy-lo (/MOVFL (Ale) fivecor 
29 AG yi losy-1lu / OC] (alt) fiv dat 
30 Je lusy iGo (alt) fiv (alt) enn 
31 oD Ce lusy-la hth well (alt) fiv (alt) enn alt 
32 ‘oF t" lusy-le i He OH (alt) fiv (alt) enn bik 
33 ae WL lusy-li 1 ( & (alt) fiv (alt) enn cor 
34 JU lusy-1o 1 { @ (alt) fiv (alt) enn dat 
35 J J lusy-1u { AO (alt) fiv bik enns 
36 SC 6 1aum 1 A] (alt) fiv bik enns alt 


al 
entries in parentheses can be deleted for the sake of efficiency. 
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Method 


In a broad sense, the present study was a replication of the Hamilton (1961) 
study with some key changes in the experimental treatment. As in Hamilton’s 
study, a completely new place value system was developed with new number 
symbols and names; unlike Hamilton, the base chosen was five rather than six (it 
was decided that if the base was to be rather small, it should be prime). Table 1 
lists the names and numerals used by Hamilton and the ones used in the present 
study. 


The following points of comparison and contrast should be noted in Table 1. 

1. Both systems have a base for grouping and use powers of the base to define 

place value for the digits within a numeral. However, Hamilton has 
incorporated the idiosyncrasies of the English language version of the names 
(see especially his “lum” numbers £ * through S J. corresponding to the 
“teen” numbers); the Sawada/Atkinson version eliminates all idiosyncrasies 
(historical accidents in the evolution of the development of the English 
language version), instead placing priority on the logical determination of 
names insofar as it is possible (see for example the names for the numbers 
/?, through /@ ). By emphasizing the logical determination of all 
aspects of the system, the arbitrary conventions inherent in the English 
language version are seen for what they are — arbitrary conventions — 
something which cannot be seen if the system is approached by rote. 

2. Both systems use mnemonic devices to decrease the load on memory in 
remembering and ordering the names for the single digit numerals: the 
vowels in Hamilton, alphabetic sequence in Sawada/Atkinson. 

3. The systems differ in the structure of the digits. Hamilton’s are essentially 
meaningless squiggles which are so much alike (four of the six are actually 
congruent) that learning them is made more difficult than should be. The 
Sawada/Atkinson digits possess strong tally meaning, thus facilitating the 
learning and the recall of the digits. 

4. There are fewer basic facts in base five than in base six (44 fewer if all four 
operations are considered); yet there is a sufficient number to underscore the 
importance of mastering the basic facts (via games, puzzles, etc.) 
before algorithms can be effectively learned. 


Design 

The design consisted of a pretest-posttest arrangement used on experimental 
and control groups formed with random assignment of students. During the fall of 
1978, 106 students enrolled in a semester course in elementary school mathematics 
methods. Of these, 30 were randomly chosed for the experimental treatment while 
the remainder were randomly partitioned into four groups of 19. Thus, for 
purposes of instruction, the four control groups were at an advantage if smaller 
class size makes any difference. Later on, for the purpose of data analysis, data 
from 11 randomly chosen students from three of the control groups were 
combined with the data from the remaining control group to provide a data set 
based on 30 control subjects to compare with the data from the 30 experimental 
subjects. Classes met three times a week for 50-minute periods. On Mondays, all 
106 students attended a common lecture. On Wednesdays and Fridays, students 
broke up into the five smaller groups for laboratory activities and discussion. It 
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was during these small group sessions that the experimental group received the 
Sawada/Atkinson Numeration System materials while the four control groups 
worked with regular materials based largely on Kennedy’s (1975) methods text in 
which there is a fair amount of nondecimal work which was neither stressed nor 
ignored in the control groups. The manipulative aids used in the experimental and 
control groups were the same except that base five blocks were used in the 
experimental group while only base ten blocks were available in the control 
situation. Since the control groups were somewhat smaller than the experimental 
group, control students may have had more opportunity to engage in experiences 
with manipulative materials. The basic methodology employed for all lab work for 
all groups could be described as guided discovery. The experiment was conducted 
over a period of four weeks during which the three mathematics topics under 
study were numeration, basic facts, and algorithms. The amount of time devoted 
to any one of the three topics was the same for all groups. However, with the 
experimental group, the instructor used both systems of numeration to illustrate 
basic facts and algorithms, whereas with the control groups, base ten numeration 
was used almost exclusively. The control groups were alerted to the existence of 
nondecimal systems using conventional numerals whereas the experimental group 
learned base five numeration through the medium of the Sawada/Atkinson system. 
One of the authors supervised the experimental group while the other, with the 
assistance of the third staff member, supervised the control groups. 


TABLE 2 


THE PRETEST AND POSTTEST QUESTIONS 


1. Write the numerals for the first fifteen counting numbers. Circle the 
numerals whose meaning depend on an understanding of a fundamental 
principle of numeration. State that fundamental principle. 


2. Do you think a grade one child of at least average intelligence would 
have difficulty in understanding the first of your circled numerals 
in question 1? State your reason(s). 


3. In reading the number 100 we say "one hundred" or "a hundred" or 
"hundred." In reading the number 10 we only say "ten." Explain why. 


4. What differences might exist in the thinking of a child (about grade 2) 
in computing 23 - 5 as compared to 13 - 5? 


5. One of the reasons for recommending the universal adoption of the metric 
system is the fact that it is based on ten. If our system of numeration 
had been base eight instead of base ten would the metric system be so 
attractive? State your reason(s). 

6. Carry out the following computations in the bases suggested: 

(a) iS xe eneabasemtd ver 


(b) Seer x e24ine baseusiux: 


Show your work. 


a SD 
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Instrumentation 


Two basic and mutually supportive types of data were collected: (1) paper and 
pencil test data assessing understanding of decimal and nondecimal numeration 
and the impact of numeration on basic facts and algorithms, and (2) interview 
data assessing the impact of the laboratory activities on student awareness of 
difficulties children may have. The paper and pencil test consisted of 6 questions 
which were given in pretest and posttest form, the same 6 questions appearing in 
both tests. These 6 questions (see Table 2) were designed to assess whether the 
Sawada/Atkinson System resulted in any transfer for the purpose of developing 
insight into the base ten system. 


Eight students each from both the experimental and the combined control 
groups were interviewed. These were selected so that four of the eight from each 
group were active participators in the lab activities while the other four were 
much less so. The basic questions in the interview were the same for both the 
control and experimental groups. 


Analysis 

The paper and pencil test data were handled in a somewhat novel way. The 
usual procedure would have been to use either an analysis of covariance with the 
pretest as a covariate, or to subtract the pretest from the posttest and conduct an 
analysis of variance on the difference scores. The weakness of difference scores as 
a metric has long been recognized. The weakness of the analysis of covariance in 
terms of external validity has only recently been recognized. To avoid both these 
weaknesses, it was decided to assess directly the difference between a subject’s 
response on the pretest and his response on the posttest for each question. These 
directly assessed differences were then summed over the 6 questions to give rise to 
a criterion measure. The direct assessment was done blindly, so that the assessor 
did not know whether the data were from an experimental or a control subject. In 
order to acquire a measure of reliability of the assessment technique, both 
investigators undertook the assessment independently. A correlation coefficient 
between their scores was calculated and a value of 0.87 obtained. This value is 
higher than reliabilities usually obtained on difference scores. To increase the 
stability of the assessment scores even further, the scores given by each 
investigator were averaged for each student and these mean ratings were used as 
the criterion variable. 


Results 


Paper and Pencil Data 

A simple ¢ test was performed on the mean scores of the experimental and 
control groups. As shown in Table 3, the obtained ¢ was significant at the 0.01 
level indicating that the treatment had significant effects in the desired direction. 
An examination of the scores for each individual question showed that the 
experimental group demonstrated greater gain on all six questions, but especially 
on question 1, a result which is commented upon further in the discussion. 


Interview Data 


The major purpose for gathering the interview data was to ascertain in what 
specific ways the students learned from the experiences provided in the 
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experimental as compared to the control situation. In particular, it was anticipated 
that the experimental group, in working (struggling) with the special numeration 
system, would become much more aware of the thinking required in creating a 
new numeration system thus jarring them out of their too comfortable 
complacency. Question 1 deals with this. As shown in Table 4, the experimental 
group gave many more (11 vs. 4) specific responses to the question relating to the 
difficulties encountered by the student in working with the numeration activities. 


TABLE 3 


TEST OF TREATMENT DIFFERENCES 


Source Mean df i Probability 
Experimental group Za 
1/58 2.64 0.01 
Control group don0) 
TABLE 4 


INTERVIEW DATA: FREQUENCY OF SPECIFIC RESPONSES 
GIVEN TO EACH QUESTION 


Question Frequency 


Experimental Group Control Group 


1. What difficulties did you encounter 
in doing the activities? tal 4 


2. Did the activities alert you to any 
difficulties children may have in 
learning base ten numeration? 2) 7 


3. In what ways did the manipulative 
materials help you in understanding 
numeration? 6 14 


However, experiencing difficulty is not of particular value unless it makes the 
student aware of possible difficulties that children might have in understanding 
base ten. Thus, question 2 asks whether they became aware of difficulties children 
might encounter in learning base ten numeration. Again, Table 4 indicates that 
the experimental subjects were able to give many more specific suggestions (25 vs. 
7). This rather large difference is taken as strong support for the effectiveness of 
the simulation. 
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Question 3 was asked for the purposes of control. In studies of the present 
kind in which different instructors are used for the experimental and control 
treatments, instructor differences in terms of effectiveness become viable alternate 
hypotheses for group differences. In order to get a reading on the possible 
operation of this confounding source of variance, students were asked question 3, 
dealing with a characteristic common to both treatments. The data indicate that 
the control group was able to generate more specific responses (14 vs. 6) than the 
experimental group when asked in what ways the manipulative aids helped in 
their learning. The superior performance by the control subjects suggests that the 
instruction given in the control situation could by no means be characterized as 
generally inferior to that given in the experimental situation, thus annulling, to 
some extent, the alternate hypothesis stated earlier. In retrospect, the superior 
performance of control subjects on question 3 was probably due, at least in part, 
to two facts: (1) the control groups had access to a wider variety of manipulative 
aids, (2) the control groups were considerably smaller than the experimental 
group, making experiences with manipulative materials easier to obtain. 


Discussion 


The evidence gathered in the present study shows rather conclusively that 
work with a nondecimal numeration system involving entirely new number 
symbols and names can lead to heightened awareness and appreciation of 
difficulties that children might have in understanding base ten numeration. 
Furthermore, not only did the experimental subjects show more insight into base 
ten (question 1), but they also learned to operate better with nondecimal bases 
using traditional notation than did control subjects (question 6). The transfer of 
insight to base ten was the underlying assumption of the simulation. That this 
assumption is valid is indicated by the results. It thus appears safe to infer that 
the use of a nondecimal numeration system such as the Sawada/Atkinson system 
has value when used in preservice teacher education. Such a conclusion is in 
concert with Hamilton’s (1961) findings but at variance with those of Haukebo 
(1967). It is suggested that Haukebo’s counter results were due, at least in part, to 
introducing too many different bases (octagonal, binary, and duodecimal) over a 
short period of time (3 weeks). It is likely that students mastered none of the 
nondecimal systems resulting in little or no transfer of insight to base ten. In the 
present study, as in Hamilton’s, a three-week period or longer was devoted to one 
nondecimal system with greater benefit. 

With the advance of metrication and the current availability of hand-held 
calculators, a timely suggestion for further research would be to assess the efficacy 
of the study of a nondecimal numeration system in producing understanding of 
decimal fraction numerals for prospective teachers. 


This research was supported by a grant from the Alberta Advisory Committee on Educational Studies. The assistance of Eileen 
Perfrement in the conduct of this research is gratefully acknowledged. 
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Participatory Supervision in Education: 
A Must for the Eighties 


Recent trends in educational literature indicate that school administration is 
becoming more participatory. In particular, attempts at instructional 
improvement are relying more on human resources styles of supervision. It is 
suggested in this paper that a management by objectives model can provide an 
effective procedure for staff supervision. From the results of a study utilizing a 
modified management by objectives model which are presented, there is an 
indication that human resource development can be facilitated by utilizing such 
a model. Participants had a high commitment to the organization and high 
motivation to succeed. A modified management by objectives model appeared to 
hold great potential for implementing a participatory approach to management. 


Research by Halpin (1955) and Blake and Mouton (1964) suggested that 
successful leaders employed a humanistic style of management. Recent research by 
Likert (1976) in his discussion of collaborative management support these 
findings. It is becoming more apparent that material resources cannot enlarge, 
that the additional output necessary for organizational success can be realized only 
by a fuller exploitation of existing human resources. 

To achieve success in an educational institution, an organization must build a 
team, a true team in which individual efforts are welded into a common effort, 
where each member contributes something toward a common goal. Each position 
and each individual effort must be directed toward the objectives of the total 
enterprise. The value of each individual, subordinate and superordinate alike, 
must be measured in terms of the contribution he/she makes to the success of the 
total system. The manager must provide the leadership for that team. 


Today we are being challenged by professional educators to examine new 
theoretical conceptualizations and to exercise more dynamic leadership in the 
instructional improvement process. There has been renewed interest in the theory 
and practice of instructional supervision during the past five years. During this 
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time more than two dozen professional texts on instructional supervision have 
been published in North America. In addition dozens of articles have appeared in 
professional journals. There have also been many attempts to clarify the confusion 
associated with supervision and supervisors. Some discernible trends have 
appeared in the recent literature. 


The concept of viewing supervision as a team approach, involving all personnel 
above the teaching ranks, has received some attention in recent years. Wiles and 
Lovell (1975) discuss this trend when they state that “instructional supervisory 
behaviour is assumed to be an additional administrative behaviour system 
formally provided by the organization for the purpose of interacting with the 
teacher behaviour system” (p. 6). Similarly, Alfonso, Firth and Neville (1975) refer 
to the “professional instructional team” (p. 36) as a pattern for the combining of 
diversified talents and efforts as a means of having a direct impact upon the 
school or unit with whom the team is working. 


Another trend, suggested by Sergiovanni (1975) is to view the instructional 
improvement process as human resources supervision. This conceptualization is 
based on the assumption that identity, commitment, and motivation are essential 
ingredients in any practice in supervision. In this perspective it is assumed that 
teachers are professionals, that they want to improve themselves professionally, 
and that “teachers have a desire for and a need for full participation as 
shareholders in the affairs of the school and that they are prepared to work very 
hard on behalf of the school as they have the opportunity” (p. 12). 


A further development is considerable agreement today that supervision is not 
the province of a particular person but is a natural part of the school organization. 
It must be a team approach involving all members of the instructional staff. The 
purpose of supervision was and still is to improve instruction. It is a process 
designed to facilitate pupil learning and to achieve the goals of the organization. 
The goals have not changed, the means for achieving the goals have changed 
because of new insights into how organizations function. This perspective was 
advanced by Harris (1976) when he stated, “The archaic notion of the lone 
supervisor, without status, disguised as just another teacher, will not suffice. This 
is an organized society in which lone individuals are doomed to be ineffectual” (p. 
334). 


What models are available for supervisors to adopt in their attempts to 
maximize the human potential of educational institutions? This paper will 
describe a management model which can provide a humanistic approach to 
supervision and presents findings from research that sought to establish whether 
the use of that modified management by objectives approach facilitates human 
growth and organizational success. 


Management by objectives. 


Management by objectives is a systematic means of isolating specific tasks to 
be accomplished by managers over an agreed time period. The basic philosophy 
has been described by Odiorne (1965) as 

a process where the superior and subordinate managers of an organization jointly 
identify its common goals, define each individual’s major areas of responsibility in 
terms of the results expected of him, and use these as guides for operating the unit 
and assessing the contribution of each of its members. (p. 26) 
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Peter Drucker (1954), the originator of MBO, described the proactive nature of 

the concept as follows: 
it is management’s specific job to make what is desirable first possible and then actual. 
Management is not just a creature of the economy, it is a creator as well. And only to 
the extent to which it masters the economic circumstances and alters them by 
conscious, directed action, does it really manage. To manage a business means 
therefore to manage by objectives. (p. 12) 

In operational terms MBO refers to a process whereby superiors and 
subordinates meet and discuss goals (results) for the subordinates which are in 
line with organizational goals. Following this discussion the superior and 
subordinate jointly establish attainable goals for the subordinate. Once the series 
of objectives is specified and the resources are agreed upon, programs to meet the 
objectives are placed into operation and measurement of achievement is set for a 
predetermined date. Strategies for the improvement of performance are frequently 
stated. Several questions need to be answered when participants write 
performance objectives, e.g., What measurable change is to occur? What amount 
of time is allowed to accomplish the desired change? What is the minimum level 
of achievement allowable? At what point in time is the performance output to be 
measured and evaluated? Following the operation of the program, the superiors 
and subordinates meet to evaluate the subordinate’s performance in terms of the 
goals. 

A prime advantage of this approach is that subordinates know where they 
stand with regard to the contributions they are expected to make to the 
organizational unit and the system. It is expected that the process will result in 
improved employee and organizational performance, improved morale and 
attitudes of the participants, and reduced anxiety from job ambiguity. Since MBO 
involves an analysis of an employee’s efforts in terms of clearly defined objectives 
and results, it is a more positive appraisal method. The subordinate becomes an 
active agent, rather than a passive object to be appraised. The accent is on 
performance which is directly related to organizational goals. 


Research on MBO in Industry 
Studies assessing the success of MBO in business organizations have 
been conducted for many years. 

In a study conducted in 1965, Raia (Raia, 1974) cited increased awareness of 
organizational goals, improved communication and understanding of organizational 
purposes as positive outcomes of MBO. A follow-up study, however, showed that 
many of the participants felt that performance levels had not improved because of 
MBO. 

Tosi and Carroll (1968) interviewed 48 managers involved in a work planning 
and review program. Generally positive reactions were noted relative to more 
objective performance evaluation, role clarification. In addition higher levels of 
performance and motivation were noted. They also noted that the subordinate’s 
initial enthusiasm turned to disenchantment when the model was violated insofar 
as feedback relative to objectives was not given and superiors provided only 
surface support for MBO. 

Ivancevich (1974) conducted an empirically-based longitudinal study of 
performance in a manufacturing company utilizing an MBO model. He assessed 
the performance of 181 supervisors in the production and marketing departments. 
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He noted that the success of MBO was enhanced by the use of memos, letters, 
and meetings. Improved performance by a majority of the supervisors was noted. 


A comprehensive study by Stein (1975) surveyed 428 lower and middle 
management personnel in organizations where MBO had been in practice from 
two to five years. The study involved ten different organizations and sought 
answers to three research questions. The first question asked if MBO was 
inherently faulty. Stein found that it was not if the process was properly 
implemented and managed. Secondly he asked if substandard training and 
implementation procedures are the primary causes of failure. Results suggested 
that these factors were indeed important contributing factors. Thirdly, Stein 
investigated if the primary cause of failure was inadequate management attention. 
The study strongly supports this contention. Lack of appropriate review policies, 
unrealistic planning, and poor delegation of objectives were indicated as symptoms 
of ineffective management. A number of problems associated with MBO are 
outlined by Stein. These include (1) difficulty in writing meaningful objectives, 
(2) lack of follow-up and updating procedures, and (3) lack of commitment by 
upper management. Stein concludes that despite the criticisms stated by the 
participants, over 80 percent still favoured the retention of MBO. 


In summary, studies in industry suggest that MBO is a viable management 
system for that sector. If it is to succeed, the manager must possess a high level of 
management skill, especially in the areas of goal setting, communication, and 
interpersonal relations. 


MBO in Education 


For the purpose of this paper MBO in education is defined as a disciplined 
communication process which relies heavily on an interview process, whereby all 
persons in an organization cooperatively express their hopes and aspirations for a 
stated period and rationalize them within the context of the overall goals of the 
corporation as stated by school officials. 


Research on MBO in Education 


From a review of the literature, it can be seen that MBO may be used 
successfully in education. 


Fleming (1978) suggests that no one appears to disagree with the need to 
define objectives or to measure the results and modify programs accordingly. 
There exists an overall realization that educational effectiveness must be 
improved; there is a need for the schools to provide reasonable standards of 


achievement for students, particularly as it relates to literacy and basic computing 
skills. 


Boston and Grove (1978) describe a results-oriented management system 
developed by the Indian Hill School District near Cincinnati, Ohio. The program 
involved most of the concepts which have been developed in this paper. In 1974 a 
qualified citizens’ committee, appointed by the board of education, identified the 
following problems: (1) lack of long range goals, policies, and guidelines, (2) lack of 
budgetary procedures for monitoring and forecasting expenditures, (3) lack of 
definitive job descriptions that show individual responsibilities of staff members, 
(4) lack of a systematic and rational basis for the evaluation of programs, 
facilities, and staff, (5) lack of staff involvement in those decisions directly 
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affecting their responsibilities, and (6) lack of an organized program of staff 
development. A modified management by objectives approach was adopted. A 
program for the development of district goals was initiated. Similarly, a plan for 
the development of steps necessary for the attainment of the goals was 
established. A course of action for the attainment of each individual objective was 
developed which included initiation and completion dates, resources, activities, 
and tasks for each objective. It was understood that if the required condition for 
the completion of an objective changed, the objective would be modified 
accordingly. Provision was made for achievement to be reviewed on a regular 
basis. By 1976, it was felt that the problems identified earlier by the citizens’ 
committee had been dealt with successfully. 


In another article (Herman, 1978), the MBO system used in the West 
Bloomfield School District of Orchard Lake, Michigan is described. The major 
strengths and weaknesses of the process are discussed. The author concludes by 
stating 

MBO is a useful tool if you use it wisely and establish your local districts modified 
system. West Bloomfield’s students have benefitted by its use, and the management of 
many aspects of the district’s administrative operation have improved from its use. (p. 


10. 


83) 


What It Is 


A clear statement of the major 
features of the job. 


A cooperative effort where the 
participants discuss goals 
to be achieved. 


A commitment through participation 
where each individual is able to 
rationalize his own career expecta- 
tions with goals of the organization. 


A process where the goal setter 
is an active agent. 


A commitment for future action 
that will improve performance. 


Measure of current performance. 
A positive approach to job analysis. 


A potent way to motivate people through 
superior-subordinate interaction based 
on the belief that each individual knows 
his own capabilities better than 

anyone else. 


A process that leads to self-evaluation. 


A process that considers the development 
of human resources a primary means of 
achieving the objectives of the 
organization. 


Figure 1. Pros and cons of MBO in education. 


What It Is Not 


Based on out-of-date job 
descriptions or misconceptions. 


An annual or semi-annual 
special ceremony where 
everyone is ill at ease. 


Formal discussions where the 
individual is told by the 
superior the goals he is to 
reach. 


A process where the goal setter 
is a passive object. 


A review of past failures with 
directed solutions from the superior. 


Measurement of potential. 

An accounting that stresses appraisal. 
A superior sitting in judgement on a 
subordinate, autocratic management 
based on the belief that superiors 

can judge performance of subordinates. 
A process that motivates through fear. 
A process that stresses organizational 


patterns as the primary means of 
ensuring productivity. 
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Moberly and Stiles (1978) describe an MBO system which was used 
successfully in Seattle to assist the school board in addressing its primary tasks: 
long range planning, setting priorities, and evaluating the superintendent. The 
procedure used was as follows: (1) the superintendent delivered a report and a 
self-evaluation based on the goals stated by the board at the time of his 
appointment; (2) the board evaluated the superintendent; and (3) long range goals 
were reassessed and priorities set for the next year by the board and the 
superintendent. 

In their assessment of the MBO system used in the schools of Hyde Park, New 
York, Gray and Burns (1979) conclude that where MBO has not lived up to 
expectations in education, some of the basic concepts, such as span of control, 
have been violated. These authors state that the most successful applications of 
the process have been with persons facing dismissal; in such cases very specific 
objectives were drawn up and followed and since the process was a matter of 
survival it was taken seriously. In general, the authors conclude that MBO holds 
great potential as a management tool that can bring benefits to students. 


Figure 1 further exemplifies what MBO is and what MBO is not when applied 
to education. 


A Research Study 


As part of a movement to meet the need for more effective administration, the 
Carleton Board of Education in Ontario implemented a_ results-oriented 
management (ROS) system in 1974. For the purpose of this paper ROS will be 
defined as a cyclic process of growth modification which uses bilateral objective 
setting activity with prime emphasis on formative assessment. As indicated in this 
definition the process is designed to improve, not to prove. It assumes that the 
participants are professionals who are inherently motivated to improve their 
performance. This specific orientation to professional growth is a key to the 
process. By emphasizing the formative evaluation component and de-emphasizing 
the summative evaluation aspect, a climate of mutual trust and cooperation can be 
developed. The basic purpose of ROS is to improve the teaching and learning 
process in the schools. This is accomplished through the development of human 
resources in the system. The definition stresses that objectives are bilateral in 
nature for organizational objectives are dependent upon many others for 
completion and. a sharing of objectives and constant review of accomplishments 
are necessary for progress. 


Since its inception ROS has undergone several modifications to adapt to 
changing conditions, including evolution from a strictly individual goal-setting 
process to a procedure in which individuals set goals within the context of the 
total organization. The most significant aspect of the Carleton Board of Education 
project has been the application of the process to the classroom, in the form of 
results-oriented supervision. Results-oriented supervision is an effort to humanize 
the supervisory process and is viewed as a continuous process and not simply an 
inspection reporting procedure. The ROS process is in continuous operation. 
Gradual planned progress toward predetermined objectives is frequently assessed. 
The role of the supervisor becomes that of assisting the teacher in facilitating the 
achievement of the goals. Individual teacher objectives are shared by the 
supervisor who develops a commitment to the program. The cyclic nature of the 
process is stressed. New objectives are set and reviewed on a regular basis. The 
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process of formulating objectives, establishing procedures for meeting the 
objectives and evaluating the program becomes a major task of the school team. 


The actual method involves three phases. Objectives are reviewed and set at a 
preconference, the process is implemented and monitored, and a postconference 
takes place to assess the results and restate objectives. It is hypothesized that the 
process will produce a high quality of instruction by a highly motivated staff and 
that the major benefit will be to students. It is further assumed that growth will 
pervade the entire school and school system. 


In an effort to find out if the ROS process was indeed producing the intended 
results, a study of the supervisory program was commissioned in 1979. The 
purposes of the study were as follows: (1) to obtain information from teaching 
staff on progress in ROS, (2) to re-establish current needs and goals concerning 


ROS. 


Methodology 


The committee decided that data should be gathered from all participants in 
the ROS process; this included teachers, department heads, vice-principals, and 
principals. Data were to be collected by questionnaire and selected interviews. 
Since no questionnaire existed which would adequately measure the status of 
ROS, it was necessary to construct suitable instruments. Questions dealt with the 
actual process, ideal procedures for supervision, attitudes toward ROS, and 
perceptions of supervisory leadership. Each questionnaire was piloted with a small 
sample of the teachers and modifications made. The instruments were distributed 
to the participants during the latter part of the 1978-79 school year. The size of 
the sample is given in Table 1. 


TABLE 1 


RESPONSES TO QUESTIONNAIRES 


SS nmnnnnnTnnTTTITEETTTTEEEEEEEEEEEEEEEEEEEEE 


Group Number Mailed Number Returned % Returned 
Elementary Teachers 358 255 Tales? 
Secondary Teachers 254 Tf} 68.1 
Department Heads 164 120 US yrae 
Vice-Principals 70 59 84.2 
Principals 70 63 90.0 

Total 916 670 Sh ll 


Interviews with a segment of the population were conducted by researchers 
from the University of Ottawa. Information was also obtained in the interviews 
concerning strengths and weaknesses of the process which did not lend themselves 
to specific questions. 


Results 
In regard to the question, “To what degree are Carleton teachers involved in 
an ROS process?”, the data indicate that 47 percent of the respondents were 


381 


I. I. Dow 


utilizing ROS. The finding is significant on two counts. First, though the literature 
indicated that very few teachers set objectives for the improvement of job 
performance, these data indicated that almost half of the teachers in the Carleton 
system were involved in such a process. Secondly, the respondents indicated that 
assistance in writing purposeful objectives which contained the criteria for success 
had been provided and that they were improving in their ability to write 
meaningful objectives. The fact that 80 percent of the teachers who were involved 
in the ROS process found their supervisor to be helpful in facilitating the 
attainment of goals suggests that teachers were reacting positively to the 
supervisory assistance given by administrators. 

In regard to the second research question, “Was the ROS process threatening 
to teachers?”, it was found that 90 percent of the respondents who were utilizing 
the ROS process found the procedure to be nonthreatening. This is one of the 
most significant findings of the study since the literature suggests that most 
supervisory activities are threatening to teachers and that no other supervisory 
process received such acceptance. A major objective of the ROS process was being 
realized: a supervisory program had been implemented that was acceptable to 
professionals. 

The third research question asked if the ROS process had improved job 
performance. The data indicate that 95 percent of the teachers who had 
participated in ROS felt that the experience had improved their job performance. 
The finding was substantiated by the principal data. This finding is significant to 
school administrators since it suggests that the ROS process can be used 
successfully to improve instruction. 

In regard to the fourth research question, “What percentage of teachers 
reacted favourably to ROS?”, it was found that over 90 percent of the teachers 
were satisfied with the ROS process. This finding is significant since most 
supervisory procedures produce a high level of dissatisfaction. A highly motivated 
staff was evident in the response rate. This fact was substantiated by the 
interviews. 

Research question number five was designed to ascertain if the superiors 
involved in the ROS process were rated more favourably as persons and as 
supervisors than superiors not involved in ROS. Table 2 summarizes the findings. 


TABLE 2 


MEAN RATINGS OF SUPERIORS BY TEACHERS 


Group Involved Mean Rating of Superior 
As a person As a supervisor 
Not involved in ROS at all 2.85% Slo 
Utilizing an ROS process 2.89 335 


* : * * . . : 2 
Range of possible scores is 1-7. Low score indicates positive evaluation. 


In most cases the individual was rated higher as a person than as a supervisor. 
In situations where the teachers were involved in ROS, the supervisor was rated 
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quite highly as a supervisor. These findings are significant to educators. Data of 

this study suggest that the best way to become known as a supervisor is to 

supervise. Related literature indicates that administrators are reluctant to 

supervise since it may jeopardize their relationships with teachers. Our data would 

not support this contention. Teachers in this study expected their superiors to 

perform a supervisory role and respected those who fulfilled this responsibility. 
Department Heads. The specific research questions concerning department 

heads were 

1. To what degree are department heads involved in an ROS process? 

2. Were department heads being supervised by an ROS procedure? 


In regard to the first question it was found that 75 percent of the respondents 
had identified lists of objectives for the improvement of performance of the 
teachers in their department. A surprising finding was that in only 55 percent of 
the cases had the objectives been established jointly by the department head and 
the teacher. In the other cases they were either a group decision or had been 
produced by only one of the participants. These findings are significant on several 
counts. Initially, the ROS process stresses the necessity of establishing the 
objectives jointly. In almost half of the cases this was not happening with the 
department heads. They were not following the suggested ROS procedure. It 
appears that many department heads were supervising as they had done 
previously only now they were calling it ROS. It is not surprising that most 
department heads were setting objectives for their subject areas as this is part of 
their job description; however, many were not working with individual teachers in 
establishing the objectives. Understandably, it is difficult to determine if 
individual teacher progress was made in achieving objectives. 


In regard to the second research question, “Were department heads being 
supervised by an ROS procedure?” it was found that only 15 percent of 
department heads were being supervised in such a manner. Although almost 40 
percent set objectives for the improvement of job performance, many of them had 
determined the objectives without consultation. Therefore the ROS process at the 
department head level had not been fully developed. Literature suggests that 
administrators tend to supervise as they are supervised. If administrators are not 
involved in an objective-setting process with their superiors they may not utilize 
such a procedure with their subordinates. In the Carleton study several reasons 
were given for the low utilization of an ROS process. Lack of time, lack of specific 
subject expertise, and the low priority allotted to supervision were most frequently 
mentioned. | 

Vice-Principals. All of the schools in the Carleton Board of Educaton have 
vice-principals, but their responsibilities vary. Some vice-principals teach full-time 
while others have no designated teaching responsibilities. Since this position has 
been traditionally a training position, it was assumed that many of the incumbents 
would be performing a supervisory role. The researchers wanted to ascertain if 
vice-principals were being supervised by their superiors utilizing an ROS process 
and if vice-principals were supervising teachers via an ROS procedure. Fifty 
percent of the respondents were being supervised by their principals using an ROS 
approach. A similar number reported that they were supervising teachers either on 
their own or in conjunction with the principal and using an ROS procedure. 


Interview data supported the suggestion that vice-principals were attempting 
to implement the ROS process on a wide scale throughout the system. These 
findings are significant. If the ROS procedure is to gain wide acceptance in the 


383 


I. I. Dow 


Carleton schools it must have the support of all levels. Vice-principals will be the 
main supervisors in the schools of the future. If they are utilizing an objective 
setting procedure in their training program, they are more inclined to use it when 
they become principals. A number of vice-principals were conducting seminars in 
their schools to acquaint teachers with the ROS process. This procedure appears 
to have potential as a staff development method at the vice-principal level. 


Principals. During the introduction of ROS in Carleton, the principals were 
identified as pivotal to successful implementation. They had _ received 
approximately 40 hours of training in the procedure. Special training was given in 
the use of a results-oriented process as well as specialized workshops in 
objective-setting, interviewing, and areas in which objectives could be formulated. 
Had all of this planning resulted in implementation of ROS in the schools? Did 
principals perceive improvement of performance in teachers? What were the major 
strengths and weaknesses of ROS? 


Results indicate that approximately one-half of the total teaching staff were 
being supervised with an ROS procedure. Several of the principals were utilizing 
ROS with the total staff. In other schools, principals were using ROS with selected 
teachers. Many schools have implemented ROS during the current school year. 
These findings indicate that the process was successfully implemented in the 
system. Earlier studies had suggested that very little supervision was taking place 
in the schools. 

With regards to the second question, “Did principals perceive improved 
performance by teachers?”, 90 percent of administrators indicated that the process 
resulted in an improvement in job performance of the teachers. This finding was 
supported by the teacher data. As indicated earlier this finding is significant in 
that it is difficult to find an alternative supervisory procedure in today’s schools 
which receives such a vote of confidence. As well as being accepted by teachers, it 
results in perceived improvement of performance. 


What were the major strengths and weaknesses of the ROS process? All of the 
principals reported that the procedure encouraged and facilitated planning. Most 
reported that the process established better communication with staff and 
enhanced professional growth of teachers. Fifty percent of the principals stated 
that the process was providing measurable benefit to students. The weaknesses 
reported by principals involved procedural and administrative problems. Many 
found the objective setting process to be very time consuming. Others reported 
that teachers were having difficulty setting purposeful objectives and needed 
additional assistance. Several indicated that from the principal’s perspective it was 
difficult to identify student benefit. A few principals indicated that the process 
was weakened by a lack of commitment on the part of their superiors. 


The problems identified in the study do not suggest inherent weaknesses in 
the results-oriented process. They indicated the need for an ongoing inservice 
program to facilitate implementation. The question of time will always be a factor 
in any administrative response. Supervision is a top priority for school principals 
and time must be found for this function. It was noted that some of the principals 


who were using the ROS process with the total staff did not report time as a 
difficulty. 


Summary 
A survey of several large school systems in 1972 reported by Saxe stated that 
to all intents and purposes there was no supervision taking place in the 
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elementary schools. Data indicate that this is not the case in the Carleton Board 
Schools. The research questions investigated the degree of involvement, reaction 
of participants, and their perception of the effectiveness of a results-oriented 
supervisory process. Approximately 50 percent of the teaching staff in the 
Carleton Board of Education are being supervised using a_ results-oriented 
approach. The process is perceived as non-threatening by almost all participants. 
ROS is perceived by teachers to assist in improving job performance. A 
considerable number of department heads were utilizing an ROS procedure with 
the teachers in their subject areas. In several instances the procedures followed 
did not represent the original program implemented by the administration. For 
example, many of the department heads had established the objectives on a group 
basis. Therefore the process was of limited value in supervising individual teachers 
in the department. Only a small number of department heads were being 
supervised by their superiors via the ROS procedure. With regard to 
vice-principals, approximately 50 percent were involved with a results-oriented 
approach to supervision. Several were carrying out the process in conjunction with 
principals. In addition, almost half of the vice-principals were being supervised by 
their principals using an ROS approach. Principal data supported the results 
obtained from the other participants. The principals were utilizing ROS with 
approximately half of the teaching staff. Ninety percent of the principals 
indicated that the process resulted in improvement in job performance of the 
teachers. This group also perceived the process to be nonthreatening to the 
participants. All of the principals reported that ROS encouraged and facilitated 
planning. In addition most indicated that ROS enhanced communication with 
staff and encouraged professional growth of teachers. Fifty percent of the 
principals stated that the process provided visible benefit to students. Weaknesses 
of the ROS process appeared to centre upon time constraints and administrative 
problems. 


Conclusion 


Results of a study in the Carleton Board of Education suggest that a modified 
management by objectives process has considerable potential for education. A 
results-oriented approach to supervision has been successfully implemented in the 
system. The procedure facilitates the development of the human resources within 
an educational institution. Results-oriented supervision enhances the feelings of 
identity, commitment and motivation necessary for creating growth in a 
professional organization. Results-oriented supervision facilitates a team approach 
to administration in schools. Recent research would indicate that participatory 
management is no longer an option. Organizations which are adapting to the 
changing needs of society must develop the concepts inherent in participatory 
management. A modified management by objectives holds great potential for 
implementing a participatory approach to management. Such a_ procedure 
recognizes the expertise of a professional staff and facilitates growth in individuals 
and in the total organization. 
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RESEARCH NOTES 


ARE CANADIANS PENALIZED 
BY THE INFORMATION SUBTEST OF THE WAIS? 


The Information subtest of the Wechsler Adult Intelligence Scale (WAIS) 
contains 29 items, 8 of which have a distinct American content (items 1, 6, 9, 11, 
14, 17, 20, 21). In the experience of the present authors, Canadians who take this 
subtest regularly express dismay at the American content of the items. The 
purpose of the present study was to ascertain whether or not the American 
content of these eight items results in depressed subtest scores for Canadian 
test-takers. 

Tsushima and Bratton (1977) noted that there has been a paucity of research 
evidence on the possible effects of geographic location and sociocultural milieux 
upon WAIS scores. In their study, Hawaiian psychiatric outpatients were found to 
perform significantly worse than U.S. mainland psychiatric outpatients on the 
Vocabulary, Comprehension, Similarities, Information, and Digit Span subtests of 
the WAIS. It appears then, that differences in the sociocultural milieux of these 
patients have affected their WAIS scores. 

In the present study, 47 adult Canadians (all held Canadian citizenship) were 
randomly assigned to take one of two versions of the Information subtest of the 
WAIS. One version was the standard scale (American version) while the second 
was “Canadianized” (Canadian version) by changing the eight items with American 
content as follows: 

(1) What are the colours of the Canadian flag? 

(6) Name four men who have been prime ministers of Canada since 1900. 

(9) In what direction would you travel if you went from Toronto to Panama? 

(11) How tall is the average Canadian woman? 

(14) When is Queen Victoria’s birthday? 

(17) How far is it from Paris to Toronto? 

(20) What is the population of Canada? 
(21) How many Members of Parliament are there in the House of Commons? 
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All other items were identical. Twenty-three subjects (48.9% of the sample) 
received the American version while 24 (51.1% of the sample) received the 
Canadian form. 

A one-way analysis of variance was performed on the total subtest scores and 
revealed no significant differences between the subjects’ performance on the 
Canadian or American versions of the subtest (F=0.703; df=1,45; p=0.4064). The 
mean score on the American version was 21.09 (SD=3.59) while on the Canadian 
version it was 20.13 (SD=4.24). An item analysis, however, demonstrated that 
subjects’ performance was significantly different (X°=14.34, df=1; p=0.0002) on 
item 6. Interestingly, on this item, subjects were able to name four past presidents 
of the United States more readily than they were able to name four prime 
ministers of Canada. 


Although this study was limited due to the relatively small number of subjects, 
it does give some indication that Canadian test-takers’ scores on the Information 
subtest of the WAIS may not be depressed due to the American content of eight 
items. 


Gurmal Rattan 
Morag Gornall 
and 
Claudio Violato 
The University of British Columbia 
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BOOK REVIEWS 


CANADIAN AND COMPARATIVE EDUCATIONAL ADMINISTRATION. 
Edited by Robin H. Farquhar and Ian E. Housego. Vancouver: University of 
British Columbia, Publications Division, Centre for Continuing Education, 1980, 
403 pp., $15.00 (Canadian). 


The Commonwealth Council for Educational Administration is an 
Australia— based organization which has initiated a series of international 
visitation programmes for scholars and others with an interest in the field of 
educational administration. These events have occurred at four-yearly intervals, in 
different parts of the world, and include, besides the rather opaque activity of 
visitation, the preparation and delivery of formal papers. This book is a 
compendium of papers from the 1978 Canadian Conference. It is well-edited, 
which is to say that it is heavily edited, and an attempt has been made to 
superimpose a logic by organizing the contributors under the heads of The 
Administrator as Mediator; Problems and Trends in the Field of Educational 
Administration; Contemporary Issues in Training Educational Administrators; 
Promises and Limitations; and The Future Study of Educational Administration. 
These sections are preceded by a prologue dealing with Leadership for 
Educational Administration in Canada and succeeded by an epilogue on The 
Contemporary Predicament. Of the 31 authors about a third are Canadian and the 
national allegiance of the rest would include not only the Commonwealth 
(Australia well represented) but also the United States, Europe, and Latin 
America. Motley perhaps, but certainly evidence of universality and motleyness in 
itself is a commendation rather than a criticism in a work of this_ kind. 
Unfortunately, some readers might be deterred by the range and variety of topic 
and focus which constantly serves to weaken the editors’ bid for thematic 
coherence. That coherence is supposed to be about “the contemporary role of the 
educational administrator as a mediator among various internal and external 
forces that influence the formulations and implementation of policy.” But such a 
theme is so broad, vague, even banal that it affords, to put it positively, the 
opportunity to talk about anything or everything of interest to practitioner or 
theoretician. There is just too much grist for the mill. 

Perhaps as a consequence of this, the articles are highly variable in style, 
content, and quality and it would be insidious and imprudent for a reviewer to 
single out specific authors for the receipt of praise, blame, or rank-ordering. It 
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must be noted, however, that many of the contributors bear names of eminence in 
the field and these include, without prejudice to colleagues not cited, William 
Taylor and Meredydd Hughes from the United Kingdom, William Walker and 
Grant Harman from Australia, Per Dalin from Norway, and Jack Culbertson and 
Jacob W. Getzels from the U.S. Taken together, the contents of the book provide 
an interesting and impressive collection of international concerns and issues in 
educational administration circa 1978. It thus constitutes a useful source book in 
comparative administration and justifies its title. At the same time it provides 
direct evidence of the fuzzy and amorphous nature of the discipline of 
administration and argues for the more correct classification of that study as an 
applied interdisciplinary nexus, a confluence into which many streams of 
information may pour and to which all of the social sciences and humanities can 
contribute. 

It may be a source of regret to some, though only an oblique criticism of the 
book, that the resounding theme which emerged from the predecessor conference 
— the intellectual controversy giving rise to the ongoing polemic between social 
science and phenomenology and all the exciting shot and shell of the 
Greenfield-Griffiths dialectic — was rather studiously ignored in the successor 
conference. As if the genie freed from the bottle before could now be subsequently 
discounted. Yet reference to that debate occurs nevertheless, even if en passant, 
in the papers of three of the contributors. 


In sum, this is a well-edited survey of contemporary issues in educational 
administration across the world. It is intellectually bland and non —- controversial. 
A valuable fund of information is presented but little to arouse and stir the 
mental appetites and passions; wisdom, if it is there, and it probably is, must be 
mined from the ore of pragmatism. The concept of administrator as mediator 
amidst a mess of contending circumstances and forces, not a very helpful concept 
to begin with, remains in less than sharp and crystalline focus at the end. But one 
can enhance one’s sophistication about the mess by reading this book. 


Christopher Hodgkinson 
Faculty of Education 
University of Victoria 


BEHAVIOUR THERAPY WITH HYPERACTIVE AND LEARNING 
DISABLED CHILDREN. By Benjamin B. Lahey. New York: Oxford University 
Press, 1979, 260 pp., $15.95 (hardbound), $8.95 (cloth). 


Perhaps no other area of exceptionality has generated as much controversy 
than has learning disabilities. Today, some 15 years after the term “learning 
disability” was coined, there is little agreement among professionals concerning 
either the nature of learning disabilities or effective methods of assessing and 
remediating learning and behavior problems. This continuing controversy is the 
subject of Lahey’s text, whose purpose is to examine the various theoretical 
approaches and to determine whether “we now possess a technology for effectively 
dealing with learning and behavior problems” (p. v). 
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The text consists of an introduction followed by three sections, each of which 
contains readings reprinted from journal articles. A brief historical overview of the 
field is provided in the introduction, in which the past 20 years are divided into 
three “eras.” Each era corresponds to a “paradigm shift” or a time when new 
conceptualizations or methodological advances significantly altered the way 
learning and behavior problems were viewed. 


In Section I, Lahey examines the research evidence on the effectiveness of 
treatment approaches stemming from first paradigm (perceptual-motor, 
psycholinguistic, and neurological) theories. Also included in this section are 
review articles on the effects of stimulant medication on hyperactive behavior. The 
second and third paradigm approaches toward the treatment of learning and 
behavior problems are presented in Section II. These two paradigms are similar in 
that they both focus on direct remediation of observable behavior problems; they 
differ with respect to the selection of the behavior to be modified. While the 
second paradigm focuses on reducing inappropriate behaviors which are thought to 
interfere with learning, the latter paradigm emphasizes reinforcing accurate 
responding to increase academic performance and to decrease inappropriate 
behaviors. Ten research articles representing second and third paradigm 
behavioral approaches are presented in this section, dealing with such topics as: 
direct instruction, cognitive behavior modification, biofeedback, and 
self-instruction. The final section of the text contains three articles which contrast 
the effects of stimulant medication and behavioral management techniques for 
controlling hyperactive behavior. 

Lahey presents a convincing case for abandoning first paradigm approaches, 
which emphasize remediating presumed underlying disabilities, in favor of a 
behavioral approach. His position is clearly supported by the lack of research 
attesting to the effectiveness of first paradigm treatment methods and on the 
consistently positive effects associated with the behavioral approach. Although 
Lahey advocates the use of a behavioral approach, he evaluates it with the same 
objectivity used to assess first paradigm theories. 

This text is by far the most authoritative source on the effectiveness of various 
approaches in the field of learning disabilities. It is both a welcome addition to 
one’s professional library and an excellent text for a graduate level course on 
theories of learning disabilities. 


Colleen S. Blankenship 
Department of Special Education 
University of Illinois at Urbana-Champaign 


THE DEVELOPMENT OF MODAL REASONING. Genesis of Necessity and 
Possibility Notions. By Gilberte Piéraut-Le Bonniec. New York: Academic Press, 


Inc., 1980, 171 pp., $16.50. 


The Development of Modal Reasoning is an experimental research essentially 
influenced by Piaget’s findings on the development of intelligence. Except for 
Chapter 4 which was rewritten because of new experimental data, the work is a 
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translation of “Le Raisonnement Modal: Etude Génétique.” In this work, 
Piéraut-Le Bonniec deals with “the genesis of this all-encompassing need for 
logical thought, and for this ability of thought to question itself” (p. ix) and how 
“children construct the notions that they use to organize their own thoughts into 
coherent systems of truth values” (p. 2). In essence, the author feels that while 
“Piaget has dealt primarily with the genesis of logicomathematical thought” (p. 2), 
“other forms of reasoning develop parallel to the logic of formal operations 
acquired in adolescence” (p. 2). One of these is the use of modal notions such as 
possibility and necessity, certainty and uncertainty, probability and impossibility. 
In this study we find a description of “the observable states in the child’s use of 
modal notions” (p. 3). An attempt is made to construct a model that would 
account for the organization of these notions in thought, but as the author puts it, 
the model which is hypothetical needs more time for validation. She explains it 
thus: 

The notion of eventuality arises from an initial system of pragmatic modalities. The 

notion of eventuality would be the point of attachment for the organization of the 

system of epistemic modalities to know that; to not know if. The notion of 
undecidability would result from the system of eventualities and the epistemic 
modalities. This notion in turn, would be the point of attachment for the construction 

of alethic modalities by coordinating the necessary and the hypothetical. (p. 153) 

In chapter 1, the author gives us a briefing of the different fields that have 
studied modal aspects of thought and, in so doing, presents elements giving us a 
more complete definition of “modalities.” We find ourselves reviewing some parts 
of syntax in the field of language, followed by classical and medieval studies of 
modality with the structural organization of modal notions: the square of 
oppositions; the difficulties of the implication relation, and the distinction 
necessary between “de re” and “de dicto” modalities. We are then introduced to 
mathematical logic. These modalities are grouped under a “question of alethic 
modality.” A section called Extensions of the notion of modality gives us a 
comparison of Alethic, Epistemic, Deontic, Existential, etc. modalities. The 
chapter concludes with the relationship of and difficulties with modality and 
quantification. 


Chapter 2 begins with a summary of chapter 1. After the summary, the 

concept of truth is elaborated, i.e., 
The sense of truth within man’s reach plays in fact a double role: on the one hand, the 
ability to discover truth relations permits action in both time and space; on the other 
hand, the search for evidence orients reasoning and the thought process. This could 
thus explain the unity underlying the various aspects of modality. (p. 41) 
Since philosophers had studied modality prior to the advent of psychology, the 
author looks at how they approached the different aspects, which brings her to 
reviewing some theories of knowledge (Descartes, Socrates, Kant, Locke). This is 
followed by the contribution of experimental psychology to the modal aspects of 
reasoning. Here the author identifies clearly genetic psychology as the only 
appropriate method for her research. She then summarizes Piaget’s stages of 
intellectual development and criticizes the theory, mainly the implication 
problems in the INRC groups. 

One comment made by the author, “The use of two-valued logic as a model in 
psychology precludes an explanation of important aspects of natural thought” 
(p. 56), leads to the following question: “Can modal logic furnish a better 
descriptive model of the operations of thought?” At this point she introduces L. 
Frey’s analysis of Piaget’s and Inhelder’s, “the billiard games.” 

We are now ready for the author’s hypotheses: 

1. The important role played by “possibility” and the existence of a form of 
hypotheticodeductive thought before adolescence. 
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2. That modal logic would be mode appropriate as a means of understanding 
the acquisition of logical necessity. 

3. The sentiments of logical necessity could arise from the simultaneous 
consideration of observable cases of possibility and impossibility. 

The model used is based on the logical square of modalities including those 

modalities concerned with the truth value propositions encountered in linguistics 

and semantics. 


Chapters 3, 4, and 5 describe the experiments used to verify these hypotheses 
and the conclusions obtained. The subjects for these experiments were chosen 
according to the three stages of intellectual development defined by Piaget. The 
experiments are presented in two parts: Part I — presentation of material which 
could be used to make certain objects; Part II — objects made from the material. 


It is difficult to evaluate the experiments performed since the author does not 
say how the subjects were chosen, i.e., what the criteria were regarding deciding 
what stage a child was really at, if the cultural background of the child was an 
important factor or if this was random choice, etc. The number of students for 
each experiment also varies, which leads us to believe that different subjects were 
chosen for each experiment. 


This would appear to be an interesting study for those involved in the field of 
Developmental Psychology. The author has answered some of the questions asked 
at the beginning of this text, such as the importance of necessity and possibility in 
the solution of problems. She has also opened the path for more research in this 
area. Even though her model is still in a developmental process, it provides a tool 
for future research. This study also offers some answers to Piaget’s critics. As 
Piéraut-Le Bonniec suggests, a new model could be developed to explain the 
transformation observed in the genesis of the modalities of reasoning. 

In the conclusion, Chapter 6, the author realizes that in the organization and 
utilization of modal notions there is an order which refers to the law governing the 
different ages and stages. (Each stage constitutes a necessary if not sufficient 
condition for the advent of the succeeding stage.) 

This is an interesting book. Piéraut-Le Bonniec’s writing style upholds interest 
in what appears to be a complicated subject. However, the observations described 
are acts of everyday living which are simple and might leave the reader with the 
question: Was something new discovered in this study? 
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